HARP: A challenging human-annotated math reasoning benchmark Paper • 2412.08819 • Published Dec 11, 2024 • 2
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 21 days ago • 19 • 4