ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration Paper • 2409.09506 • Published Sep 14, 2024 • 4
discrete-speech/interspeech2024_discrete_speech_svs_results Viewer • Updated Aug 23, 2024 • 8 • 10
Towards Robust Speech Representation Learning for Thousands of Languages Paper • 2407.00837 • Published Jun 30, 2024 • 10
Towards Robust Speech Representation Learning for Thousands of Languages Paper • 2407.00837 • Published Jun 30, 2024 • 10
discrete-speech/interspeech2024_discrete_speech_tts_results Viewer • Updated Mar 22, 2024 • 10 • 10
discrete-speech/interspeech2024_discrete_speech_tts_1h_results Viewer • Updated Mar 22, 2024 • 8 • 7
discrete-speech/interspeech2024_discrete_speech_vocoder_results Preview • Updated Mar 18, 2024 • 36
discrete-speech/interspeech2024_discrete_speech_asr_results Viewer • Updated Mar 17, 2024 • 13 • 31
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper • 2401.16658 • Published Jan 30, 2024 • 13
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper • 2401.16658 • Published Jan 30, 2024 • 13
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper • 2311.07069 • Published Nov 13, 2023 • 43