EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper • 2402.17485 • Published Feb 27 • 184
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published 25 days ago • 27
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning Paper • 2406.03344 • Published 24 days ago • 15
VideoTetris: Towards Compositional Text-to-Video Generation Paper • 2406.04277 • Published 23 days ago • 21