Linear#

class torch.nn.modules.linear.Linear(in_features, out_features, bias=True, device=None, dtype=None)[源代码]#

对输入数据应用仿射线性变换： $y = xA^T + b$ .

此模块支持 TensorFloat32。

在某些 ROCm 设备上，当使用 float16 输入时，此模块将对反向传播使用不同精度。

参数

形状

输入： $(*, H_\text{in})$ ，其中 $*$ 表示任意数量的维度（包括零个），并且 $H_\text{in} = \text{in\_features}$ 。
输出： $(*, H_\text{out})$ ，其中除了最后一个维度外，所有维度都与输入形状相同，并且 $H_\text{out} = \text{out\_features}$ 。

变量

weight (torch.Tensor) – 模块的可学习权重，形状为 $(\text{out\_features}, \text{in\_features})$ 。其值从 $\mathcal{U}(-\sqrt{k}, \sqrt{k})$ 初始化，其中 $k = \frac{1}{\text{in\_features}}$ 。
bias – 模块的可学习偏差，形状为 $(\text{out\_features})$ 。如果 bias 为 True，则值从 $\mathcal{U}(-\sqrt{k}, \sqrt{k})$ 初始化，其中 $k = \frac{1}{\text{in\_features}}$ 。

示例

>>> m = nn.Linear(20, 30)
>>> input = torch.randn(128, 20)
>>> output = m(input)
>>> print(output.size())
torch.Size([128, 30])

返回模块的额外表示。

forward(input)[源代码]#

执行前向传播。

reset_parameters()[源代码]#

根据 __init__ 中使用的初始化重置参数。

文档