Llasa
Collection
TTS foundation model compatible with Llama framework (160k hours tokenized speech data released)
•
11 items
•
Updated
•
6
Update (2025-02-13): Add Llasa finetune instruction.
These models are not mentioned in the original paper, they are essentially the same as LLaSA 1B and LLaSA 3B, except they have been fine-tuned with a mixed speech and text SFT dataset, which enables the model to retain text-based conversational abilities.
LLaSA: Scaling Train-Time and Inference-Time Compute for LLaMA-based Speech Synthesis
Base model
meta-llama/Llama-3.2-1B-Instruct