Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper • 2501.12599 • Published 4 days ago • 56
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 4 days ago • 190
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 10 days ago • 65
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 12 days ago • 47
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 12 days ago • 55
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 12 days ago • 268
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 15 days ago • 29
Reasoning Datasets Collection Reasoning datasets that are trending 🔥 • 10 items • Updated 23 days ago • 24
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 19 days ago • 81
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 18 days ago • 248
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published Dec 24, 2024 • 37
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling Paper • 2407.21787 • Published Jul 31, 2024 • 12
Solving math word problems with process- and outcome-based feedback Paper • 2211.14275 • Published Nov 25, 2022 • 8