DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Paper β’ 2503.10618 β’ Published 2 days ago β’ 14
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper β’ 2503.08638 β’ Published 4 days ago β’ 56
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper β’ 2503.07703 β’ Published 5 days ago β’ 30
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper β’ 2503.07027 β’ Published 6 days ago β’ 23
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control Paper β’ 2503.05639 β’ Published 8 days ago β’ 21
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper β’ 2503.03751 β’ Published 10 days ago β’ 19