MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published 25 days ago • 180
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 29 days ago • 143
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published Feb 11 • 47
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 263
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published Dec 20, 2024 • 38
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 58
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models Paper • 2411.05830 • Published Nov 5, 2024 • 21
Running on CPU Upgrade 12.8k 12.8k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16 Text Generation • Updated Feb 12 • 12.6k • 31