DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion Paper • 2503.01183 • Published 5 days ago • 25
view article Article Wan 2.1 by Wan AI :best cost efficient video generation model Now Available By LLMhacker • 11 days ago • 26
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • Jan 23 • 64
FuseChat 3.0 Collection Preference Optimization for Implicit Model Fusion • 14 items • Updated about 23 hours ago • 13
view article Article FuseChat-3.0: Preference Optimization for Implicit Model Fusion By Wanfq and 2 others • Dec 18, 2024 • 5
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 16 days ago • 69
view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq and 4 others • Jan 20 • 20
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 16 days ago • 244
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 93