PyTorch: 张量#

创建于：2020 年 12 月 03 日 | 最后更新：2020 年 12 月 03 日 | 最后验证：2024 年 11 月 05 日

一个三阶多项式，通过最小化平方欧几里德距离，训练用于预测从 \(-\pi\) 到 \(\pi\) 的 \(y=\sin(x)\)。

此实现使用 PyTorch 张量手动计算前向传播、损失和反向传播。

PyTorch 张量与 numpy 数组基本相同：它对深度学习、计算图或梯度一无所知，只是一个通用的 N 维数组，用于任意数值计算。

numpy 数组和 PyTorch 张量之间最大的区别在于 PyTorch 张量可以在 CPU 或 GPU 上运行。要在 GPU 上运行操作，只需将张量转换为 cuda 数据类型即可。

2783.07080078125
1856.5316162109375
1239.9876708984375
829.5586547851562
556.2243041992188
374.11029052734375
252.716796875
171.7584686279297
117.73885345458984
81.67414093017578
57.58317947387695
41.48060989379883
30.710966110229492
23.50325584411621
18.67608070373535
15.440985679626465
13.27116584777832
11.814750671386719
10.83635139465332
10.178570747375488
Result: y = 0.021165532991290092 + 0.8265973329544067 x + -0.0036514054518193007 x^2 + -0.08904273808002472 x^3

import torch
import math


dtype = torch.float
device = torch.device("cpu")
# device = torch.device("cuda:0") # Uncomment this to run on GPU

# Create random input and output data
x = torch.linspace(-math.pi, math.pi, 2000, device=device, dtype=dtype)
y = torch.sin(x)

# Randomly initialize weights
a = torch.randn((), device=device, dtype=dtype)
b = torch.randn((), device=device, dtype=dtype)
c = torch.randn((), device=device, dtype=dtype)
d = torch.randn((), device=device, dtype=dtype)

learning_rate = 1e-6
for t in range(2000):
    # Forward pass: compute predicted y
    y_pred = a + b * x + c * x ** 2 + d * x ** 3

    # Compute and print loss
    loss = (y_pred - y).pow(2).sum().item()
    if t % 100 == 99:
        print(t, loss)

    # Backprop to compute gradients of a, b, c, d with respect to loss
    grad_y_pred = 2.0 * (y_pred - y)
    grad_a = grad_y_pred.sum()
    grad_b = (grad_y_pred * x).sum()
    grad_c = (grad_y_pred * x ** 2).sum()
    grad_d = (grad_y_pred * x ** 3).sum()

    # Update weights using gradient descent
    a -= learning_rate * grad_a
    b -= learning_rate * grad_b
    c -= learning_rate * grad_c
    d -= learning_rate * grad_d


print(f'Result: y = {a.item()} + {b.item()} x + {c.item()} x^2 + {d.item()} x^3')

脚本总运行时间： (0 分钟 0.226 秒)

PyTorch: 张量#

文档

教程

资源