cuda_ctc_decoder¶

torchaudio.models.decoder.cuda_ctc_decoder(tokens: Union[str, List[str]], nbest: int = 1, beam_size: int = 10, blank_skip_threshold: float = 0.95) → CUCTCDecoder[源代码]¶

已弃用

警告

此函数已被弃用。它将在 2.9 版本中被移除。此次弃用是旨在将 TorchAudio 迁移到维护阶段的大规模重构工作的一部分。请参阅 https://github.com/pytorch/audio/issues/3902 获取更多信息。

构建一个 CUCTCDecoder 实例。

参数

tokens (str 或 List[str]) – 包含有效 token 的文件或列表。如果使用文件，则期望的格式是映射到相同索引的 token 位于同一行。
beam_size (int, 可选) – 每个解码步骤后要保留的最大假设数量（默认值：10）。
nbest (int) – 要返回的最佳解码数量。
blank_id (int) – 与空白符号对应的 token ID。
blank_skip_threshold (float) – 如果 log_prob(blank) > log(blank_skip_threshold)，则跳过帧以加速解码（默认值：0.95）。

返回

解码器

返回类型

CUCTCDecoder

示例

>>> decoder = cuda_ctc_decoder(
>>>     vocab_file="tokens.txt",
>>>     blank_skip_threshold=0.95,
>>> )
>>> results = decoder(log_probs, encoder_out_lens) # List of shape (B, nbest) of Hypotheses

cuda_ctc_decoder¶

文档

教程

资源