Wenetspeech4TTS
/

Amphion-Valle-Wenetspeech4TTS

Model card Files Files and versions Community

The vanilla VALL E train on WenetSpeech4TTS using Amphion tooltik.

The entire training process follows its training code, except that the text-to-phoneme feature step is slightly different.

Checkpoints

base_model.bin : VALL-E trained with the WenetSpeech4TTS Basic subset
38sft_model.bin : VALL-E Basic fine-tuning with the WenetSpeech4TTS Standard subset
4sft_model.bin : VALL-E Standard fine-tuning with the WenetSpeech4TTS Premium subset

usage

Inference code and more details : ISCSLP2024_CoVoC_baseline.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Examples

Unable to determine this model's library. Check the docs .

Dataset used to train Wenetspeech4TTS/Amphion-Valle-Wenetspeech4TTS