view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 6 days ago • 30
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 664
Scaling Test-Time Compute with Open Models Collection Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 23
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated 6 days ago • 73
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 11 items • Updated 5 days ago • 64
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 569