Linear#

class torch.nn.Linear(in_features, out_features, bias=True, device=None, dtype=None)[源码]#

将仿射线性变换应用于输入数据： $y = xA^T + b$ .

此模块支持 TensorFloat32。

在某些 ROCm 设备上，当使用 float16 输入时，此模块将对反向传播使用不同精度。

参数

形状

输入： $(*, H_\text{in})$ ，其中 $*$ 表示任意数量的维度（包括零个维度），并且 $H_\text{in} = \text{in\_features}$ 。
输出： $(*, H_\text{out})$ ，其中除最后一个维度以外的所有维度都与输入形状相同，并且 $H_\text{out} = \text{out\_features}$ 。

变量

weight (torch.Tensor) – 模块的可学习权重，形状为 $(\text{out\_features}, \text{in\_features})$ 。值从 $\mathcal{U}(-\sqrt{k}, \sqrt{k})$ ，其中 $k = \frac{1}{\text{in\_features}}$
bias – 模块的可学习偏置，形状为 $(\text{out\_features})$ 。如果 bias 为 True，则值从 $\mathcal{U}(-\sqrt{k}, \sqrt{k})$ ，其中 $k = \frac{1}{\text{in\_features}}$

示例

>>> m = nn.Linear(20, 30)
>>> input = torch.randn(128, 20)
>>> output = m(input)
>>> print(output.size())
torch.Size([128, 30])

extra_repr()[源码]#

返回模块的额外表示。

forward(input)[源码]#

执行前向传播。

reset_parameters()[源码]#

根据 __init__ 中使用的初始化重置参数。

文档