ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Paper • 2406.04325 • Published 25 days ago • 69
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models Paper • 2401.15947 • Published Jan 29 • 47