Zero-Shot Voice Cloning Collection TTS models that support zero-shot voice cloning • 7 items • Updated Oct 26, 2024 • 7
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 25
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 225
Emu3 Collection Emu3: Next-Token Prediction is All You Need • 5 items • Updated 13 days ago • 67
Audio Dialogues: Dialogues dataset for audio and music understanding Paper • 2404.07616 • Published Apr 11, 2024 • 15