view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 2 days ago • 219
Running 2.24k 2.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 56
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 14 items • Updated 2 days ago • 101
view post Post 2046 New smolagents example landed on Hugging Face cookbook 🤠Learn how to create an inventory managing multi-agent system with smolagents, MongoDB and DeepSeek Chat 📖 https://huggingface.co/learn/cookbook/mongodb_smolagents_multi_micro_agents See translation 🔥 7 7 🤗 4 4 😎 2 2 + Reply
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published Jan 20 • 93
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated 18 days ago • 1.59M • • 1.26k
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 18 days ago • 1.59M • • 1.03k
Running 18 18 TravelPlannerLeaderboard 💻 Display and submit evaluation results for travel planning
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published Dec 15, 2024 • 27