Open-source speech datasets annotated using Data-Speech
Open-source annotated speech datasets ranging from 1,000 hours to 45,000 hours.
Viewer • Updated • 10.8M • 8.26k • 20Note The English version of the Multilingual LibriSpeech (MLS) dataset.
parler-tts/libritts_r_filtered
Viewer • Updated • 359k • 2.46k • 16Note Filtered version of the 1K high-quality LibriTTS-R dataset.
parler-tts/mls-eng-speaker-descriptions
Viewer • Updated • 10.8M • 995 • 3Note Annotations of English MLS above. Used for v1 training.
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer • Updated • 359k • 1.04k • 3Note Annotations of the filtered LibriTTS-R dataset. Used for v1 training.
- 808
Parler-TTS
🥖High-fidelity Text-To-Speech
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
Paper • 2402.01912 • Published • 11
mythicinfinity/libritts_r
Viewer • Updated • 756k • 2.31k • 26Note A 1K hours high-quality English speech dataset.
parler-tts/mls_eng_10k
Viewer • Updated • 2.43M • 1.1k • 23Note A 10K hours subset of the English version of the Multilingual LibriSpeech (MLS) dataset.
parler-tts/mls-eng-10k-tags_tagged_10k_generated
Viewer • Updated • 2.43M • 221 • 17Note Annotations of the 10K hours subset of English MLS above. Used for v0.1 training.
parler-tts/libritts_r_tags_tagged_10k_generated
Viewer • Updated • 365k • 247 • 8Note An annotated version of LibriTTS-R above. Used for v0.1 training.
parler-tts/parler_tts_mini_v0.1
Text-to-Speech • Updated • 10.1k • 350Note A first model iteration of Parler-TTS, trained using the 10k hours of narrated audiobooks above.