Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 7 items • Updated 5 days ago • 36
Running on Zero 1.76k 1.76k Chat With Janus-Pro-7B 🌍 A unified multimodal understanding and generation model.
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 25 days ago • 67 • 3
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 25 days ago • 67
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 25 days ago • 67
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Paper • 2501.11733 • Published 27 days ago • 28
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper • 2501.01028 • Published Jan 2 • 13
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments Paper • 2501.10893 • Published 29 days ago • 24
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 26 days ago • 50
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published about 1 month ago • 43
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 273
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5 Sentence Similarity • Updated Jan 3 • 74.4k • • 55