Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis Paper • 2502.04128 • Published 4 days ago • 15
Llasa Collection TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 7 items • Updated 3 days ago • 5
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 20 days ago • 60
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model Paper • 2408.17175 • Published Aug 30, 2024 • 2