SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 12 days ago • 168
Towards Retrieval Augmented Generation over Large Video Libraries Paper • 2406.14938 • Published Jun 21, 2024 • 21
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 13 days ago • 98
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 16 days ago • 35
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 25 days ago • 319
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • 27 days ago • 34
view article Article Gradio spaces are the perfect agent tools\! By burtenshaw • about 1 month ago • 14