DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 7 days ago • 260
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Paper • 2501.12202 • Published 8 days ago • 28
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 15 days ago • 50
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published 25 days ago • 25
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published 25 days ago • 25
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published 25 days ago • 25 • 2
PhD: A Prompted Visual Hallucination Evaluation Dataset Paper • 2403.11116 • Published Mar 17, 2024 • 1
HMoE: Heterogeneous Mixture of Experts for Language Modeling Paper • 2408.10681 • Published Aug 20, 2024 • 8
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 44
PhD: A Prompted Visual Hallucination Evaluation Dataset Paper • 2403.11116 • Published Mar 17, 2024 • 1
Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication Paper • 2402.18439 • Published Feb 28, 2024
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors Paper • 2308.10848 • Published Aug 21, 2023 • 1
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs Paper • 2307.16789 • Published Jul 31, 2023 • 99
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models Paper • 2403.08281 • Published Mar 13, 2024
Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models Paper • 2310.12818 • Published Oct 19, 2023