TensorDictModule¶

class tensordict.nn.TensorDictModule(*args, **kwargs)¶

TensorDictModule 是一个 Python 包装器，围绕一个 nn.Module，用于读写 TensorDict。

参数:

module (Callable[[Any], Any]) – 一个可调用对象，通常是一个 torch.nn.Module，用于将输入映射到输出参数空间。它的 forward 方法可以返回单个张量、张量元组，甚至是一个字典。在后一种情况下，TensorDictModule 的输出键将用于填充输出 tensordict（即 out_keys 中存在的键应该存在于 module forward 方法返回的字典中）。
in_keys (iterable of NestedKeys, Dict[NestedStr, str]) – 从输入 tensordict 读取并传递给模块的键。如果包含多个元素，则按 in_keys 可迭代对象的顺序传递值。如果 in_keys 是一个字典，其键必须对应于 tensordict 中要读取的键，其值必须与函数签名中的关键字参数名称匹配。如果 out_to_in_map 为 True，则映射会被反转，以便键对应于函数签名中的关键字参数。
out_keys (iterable of str) – 要写入输入 tensordict 的键。out_keys 的长度必须与嵌入模块返回的张量数量匹配。使用“_”作为键可以避免将张量写入输出。

关键字参数:

out_to_in_map (bool, optional) – 如果为 True（默认），则 in_keys 的读取方式就像键是 forward() 方法的参数键，值是输入 TensorDict 中的键。如果为 False，则键被视为输入键，值被视为方法的参数键。
inplace (bool or string, optional) –
如果为 True（默认），则模块的输出将被写入传递给 forward() 方法的 tensordict。如果为 False，则会创建一个具有空批次大小和无设备的新的 TensorDict。如果为 "empty"，则将使用 empty() 来创建输出 tensordict。

注意

如果 inplace=False 且传递给模块的 tensordict 是 TensorDict 以外的 TensorDictBase 子类，则输出仍将是 TensorDict 实例。其批次大小将为空，并且没有设备。将其设置为 "empty" 以获得相同的 TensorDictBase 子类型、相同的批次大小和设备。在运行时使用 tensordict_out（见下文）可以更精细地控制输出。

注意

如果 inplace=False 并且 forward() 方法中传递了 tensordict_out，则 tensordict_out 将优先。这是获得 tensordict_out 的方式，传递给模块的 tensordict 是 TensorDict 以外的 TensorDictBase 子类，输出仍将是 TensorDict 实例。
method (str, optional) – 要在模块中调用的方法（如果存在）。默认为 __call__。
method_kwargs (Dict[str, Any], optional) – 要传递给被调用模块方法的附加关键字参数。
strict (bool, optional) – 如果为 True，则模块将在输入 tensordict 中缺少任何输入时引发异常。否则，将使用 None 值作为占位符。默认为 False。
get_kwargs (dict[str, Any], optional) – 要传递给 get() 方法的附加关键字参数。这在处理不规则张量时尤其有用（参见 get()）。默认为 {}。

将神经网络嵌入 TensorDictModule 只需指定输入和输出键。TensorDictModule 支持函数式和常规 nn.Module 对象。在函数式情况下，必须指定 ‘params’（以及 ‘buffers’）关键字参数。

示例

>>> from tensordict import TensorDict
>>> # one can wrap regular nn.Module
>>> module = TensorDictModule(nn.Transformer(128), in_keys=["input", "tgt"], out_keys=["out"])
>>> input = torch.ones(2, 3, 128)
>>> tgt = torch.zeros(2, 3, 128)
>>> data = TensorDict({"input": input, "tgt": tgt}, batch_size=[2, 3])
>>> data = module(data)
>>> print(data)
TensorDict(
    fields={
        input: Tensor(shape=torch.Size([2, 3, 128]), device=cpu, dtype=torch.float32, is_shared=False),
        out: Tensor(shape=torch.Size([2, 3, 128]), device=cpu, dtype=torch.float32, is_shared=False),
        tgt: Tensor(shape=torch.Size([2, 3, 128]), device=cpu, dtype=torch.float32, is_shared=False)},
    batch_size=torch.Size([2, 3]),
    device=None,
    is_shared=False)

我们也可以直接传递张量。

示例

>>> out = module(input, tgt)
>>> assert out.shape == input.shape
>>> # we can also wrap regular functions
>>> module = TensorDictModule(lambda x: (x-1, x+1), in_keys=[("input", "x")], out_keys=[("output", "x-1"), ("output", "x+1")])
>>> module(TensorDict({("input", "x"): torch.zeros(())}, batch_size=[]))
TensorDict(
    fields={
        input: TensorDict(
            fields={
                x: Tensor(shape=torch.Size([]), device=cpu, dtype=torch.float32, is_shared=False)},
            batch_size=torch.Size([]),
            device=None,
            is_shared=False),
        output: TensorDict(
            fields={
                x+1: Tensor(shape=torch.Size([]), device=cpu, dtype=torch.float32, is_shared=False),
                x-1: Tensor(shape=torch.Size([]), device=cpu, dtype=torch.float32, is_shared=False)},
            batch_size=torch.Size([]),
            device=None,
            is_shared=False)},
    batch_size=torch.Size([]),
    device=None,
    is_shared=False)

我们可以使用 TensorDictModule 来填充 tensordict。

示例

>>> module = TensorDictModule(lambda: torch.randn(3), in_keys=[], out_keys=["x"])
>>> print(module(TensorDict({}, batch_size=[])))
TensorDict(
    fields={
        x: Tensor(shape=torch.Size([3]), device=cpu, dtype=torch.float32, is_shared=False)},
    batch_size=torch.Size([]),
    device=None,
    is_shared=False)

另一个功能是传递一个字典作为输入键，以控制值到特定关键字参数的分派。

示例

>>> module = TensorDictModule(lambda x, *, y: x+y,
...     in_keys={'1': 'x', '2': 'y'}, out_keys=['z'], out_to_in_map=False
...     )
>>> td = module(TensorDict({'1': torch.ones(()), '2': torch.ones(())*2}, []))
>>> td['z']
tensor(3.)

如果将 out_to_in_map 设置为 True，则 in_keys 映射会被反转。这样，就可以将相同的输入键用于不同的关键字参数。

示例

>>> module = TensorDictModule(lambda x, *, y, z: x+y+z,
...     in_keys={'x': '1', 'y': '2', z: '2'}, out_keys=['t'], out_to_in_map=True
...     )
>>> td = module(TensorDict({'1': torch.ones(()), '2': torch.ones(())*2}, []))
>>> td['t']
tensor(5.)

我们可以指定模块内要调用的方法。与使用 lambda 函数或类似函数包装模块方法相比，它的优点是模块属性（params、buffers、submodules）将被暴露。

示例

>>> from tensordict import TensorDict
>>> from tensordict.nn import TensorDictSequential as Seq, TensorDictModule as Mod
>>> from torch import nn
>>> import torch
>>>
>>> class MyNet(nn.Module):
...     def my_func(self, tensor: torch.Tensor, *, an_integer: int):
...         return tensor + an_integer
...
>>> s = Seq(
...     {
...         "a": lambda td: td+1,
...         "b": lambda td: td * 2,
...         "c": Mod(MyNet(), in_keys=["a"], out_keys=["b"], method="my_func", method_kwargs={"an_integer": 2}),
...     }
... )
>>> td = s(TensorDict(a=0))
>>> print(td)
>>>
>>> assert td["b"] == 4

对 tensordict 模块进行函数式调用很容易。

示例

>>> import torch
>>> from tensordict import TensorDict
>>> from tensordict.nn import TensorDictModule
>>> td = TensorDict({"input": torch.randn(3, 4), "hidden": torch.randn(3, 8)}, [3,])
>>> module = torch.nn.GRUCell(4, 8)
>>> td_module = TensorDictModule(
...    module=module, in_keys=["input", "hidden"], out_keys=["output"]
... )
>>> params = TensorDict.from_module(td_module)
>>> # functional API
>>> with params.to_module(td_module):
...     td_functional = td_module(td.clone())
>>> print(td_functional)
TensorDict(
    fields={
        hidden: Tensor(shape=torch.Size([3, 8]), device=cpu, dtype=torch.float32, is_shared=False),
        input: Tensor(shape=torch.Size([3, 4]), device=cpu, dtype=torch.float32, is_shared=False),
        output: Tensor(shape=torch.Size([3, 8]), device=cpu, dtype=torch.float32, is_shared=False)},
    batch_size=torch.Size([3]),
    device=None,
    is_shared=False)

在有状态的情况下。

>>> module = torch.nn.GRUCell(4, 8)
>>> td_module = TensorDictModule(
...    module=module, in_keys=["input", "hidden"], out_keys=["output"]
... )
>>> td_stateful = td_module(td.clone())
>>> print(td_stateful)
TensorDict(
    fields={
        hidden: Tensor(shape=torch.Size([3, 8]), device=cpu, dtype=torch.float32, is_shared=False),
        input: Tensor(shape=torch.Size([3, 4]), device=cpu, dtype=torch.float32, is_shared=False),
        output: Tensor(shape=torch.Size([3, 8]), device=cpu, dtype=torch.float32, is_shared=False)},
    batch_size=torch.Size([3]),
    device=None,
    is_shared=False)

forward(tensordict: TensorDictBase = None, args=None, *, tensordict_out: tensordict.base.TensorDictBase | None = None, **kwargs: Any) → TensorDictBase¶: 当 tensordict 参数未设置时，kwargs 用于创建 TensorDict 的实例。

TensorDictModule¶

文档

教程

资源