Collections
Discover the best community collections!
Collections including paper arxiv:2311.10982
-
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Paper • 2311.11243 • Published • 14 -
Make Pixels Dance: High-Dynamic Video Generation
Paper • 2311.10982 • Published • 68 -
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Paper • 2311.10794 • Published • 24 -
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Paper • 2311.12793 • Published • 18
-
Make Pixels Dance: High-Dynamic Video Generation
Paper • 2311.10982 • Published • 68 -
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Paper • 2311.10709 • Published • 24 -
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Paper • 2311.11243 • Published • 14 -
LivePhoto: Real Image Animation with Text-guided Motion Control
Paper • 2312.02928 • Published • 16
-
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 82 -
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 37 -
FreeU: Free Lunch in Diffusion U-Net
Paper • 2309.11497 • Published • 64 -
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models
Paper • 2309.11674 • Published • 31
-
OmnimatteRF: Robust Omnimatte with 3D Background Modeling
Paper • 2309.07749 • Published • 7 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper • 2309.07314 • Published • 24 -
Generative Image Dynamics
Paper • 2309.07906 • Published • 52 -
MagiCapture: High-Resolution Multi-Concept Portrait Customization
Paper • 2309.06895 • Published • 27