view post Post 963 Reply As part of some ongoing work, I'm releasing the currently biggest collection of docker containers for state-of-the-art voice cloning TTS systems. https://github.com/ttsds/datasetsAlongside there is also a nice overview of all systems (see below) 🚀 3 3 🔥 1 1 +
view post Post 3163 Reply Kokoro: a small, fast 80M param TTS model hosted on ZeroGPU at hexgrad/Kokoro-TTS 3 replies · 🔥 13 13 +
Cosmos Tokenizer Collection A suite of image and video tokenizers • 10 items • Updated 19 days ago • 20
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 11 days ago • 274
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis Paper • 2410.23320 • Published 27 days ago • 6