MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published 4 days ago • 53
HMoE: Heterogeneous Mixture of Experts for Language Modeling Paper • 2408.10681 • Published Aug 20, 2024 • 9
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper • 2501.12599 • Published Jan 22 • 104
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 346
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Paper • 2501.12202 • Published Jan 21 • 35
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14 • 56
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published Jan 5 • 26
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published Jan 5 • 26
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published Jan 5 • 26 • 2
PhD: A Prompted Visual Hallucination Evaluation Dataset Paper • 2403.11116 • Published Mar 17, 2024 • 3
HMoE: Heterogeneous Mixture of Experts for Language Modeling Paper • 2408.10681 • Published Aug 20, 2024 • 9
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 45
PhD: A Prompted Visual Hallucination Evaluation Dataset Paper • 2403.11116 • Published Mar 17, 2024 • 3
Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication Paper • 2402.18439 • Published Feb 28, 2024 • 1