Int8DynamicActivationInt4WeightConfig¶

class torchao.quantization.Int8DynamicActivationInt4WeightConfig(group_size: int = 32, layout: Layout = PlainLayout(), mapping_type: MappingType = MappingType.SYMMETRIC, act_mapping_type: MappingType = MappingType.ASYMMETRIC, set_inductor_config: bool = True)[源代码]¶

用于将 int8 动态每 token 非对称激活量化和 int4 每组权重对称量化应用于线性层的配置。这用于为 executorch 后端生成模型，但目前 executorch 尚不支持对此流程产生的量化模型的降低。

参数: