Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published 7 days ago • 30
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper • 2503.07572 • Published 7 days ago • 34
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published 6 days ago • 56
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 6 days ago • 91
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 5 days ago • 284
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning Paper • 2503.05379 • Published 10 days ago • 32
WritingBench: A Comprehensive Benchmark for Generative Writing Paper • 2503.05244 • Published 10 days ago • 15
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published 11 days ago • 14
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published Dec 19, 2024 • 55
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published 15 days ago • 58
DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion Paper • 2503.01183 • Published 14 days ago • 26
Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions Paper • 2503.00501 • Published 16 days ago • 11
BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving Paper • 2502.03438 • Published Feb 5 • 2