TACOTRON2_WAVERNN_CHAR_LJSPEECH¶

torchaudio.pipelines.TACOTRON2_WAVERNN_CHAR_LJSPEECH¶

基于字符的 TTS 流水线，使用在 LJSpeech 数据集上训练的 Tacotron2 模型（训练 1,500 个 epoch）[Ito and Johnson, 2017]，以及 WaveRNN vocoder 模型，使用 8 位深度的 LJSpeech 波形数据训练（训练 10,000 个 epoch）[Ito and Johnson, 2017]。

文本处理器逐个字符地编码输入文本。

您可以在此处找到训练脚本。使用的参数如下：win_length=1100、hop_length=275、n_fft=2048、mel_fmin=40 和 mel_fmax=11025。

您可以在此处找到训练脚本。

有关用法，请参考 torchaudio.pipelines.Tacotron2TTSBundle()。

示例 - “Hello world! T T S stands for Text to Speech!”

示例 - “The examination and testimony of the experts enabled the Commission to conclude that five shots may have been fired,”

TACOTRON2_WAVERNN_CHAR_LJSPEECH¶

文档

教程

资源