69 309 306

Aymeric Roucher

m-ric

http://aymeric-roucher.github.io

AI & ML interests

Leading Agents at Hugging Face 🤗

Recent Activity

updated a dataset 19 minutes ago

smolagents-benchmark/answers

upvoted an article 24 minutes ago

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

updated a Space about 7 hours ago

smolagents-benchmark/smolagents_llm_leaderboard

View all activity

Organizations

m-ric's activity

updated a dataset 19 minutes ago

smolagents-benchmark/answers

Viewer • Updated 19 minutes ago • 2.38k • 35

upvoted an article 24 minutes ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

about 22 hours ago

• 26

updated a Space about 7 hours ago

smolagents LLM leaderboard

🏆

Generate custom web content using React

liked a dataset about 8 hours ago

shuaishuaicdp/GUI-World

Preview • Updated Jun 23, 2024 • 1.37k • 21

posted an update about 10 hours ago

Post

326

Less is More for Reasoning (LIMO): a 32B model fine-tuned with 817 examples can beat o1-preview on math reasoning! 🤯

Do we really need o1's huge RL procedure to see reasoning emerge? It seems not.
Researchers from Shanghai Jiaotong University just demonstrated that carefully selected examples can boost math performance in large language models using SFT —no huge datasets or RL procedures needed.

Their procedure allows Qwen2.5-32B-Instruct to jump from 6.5% to 57% on AIME and from 59% to 95% on MATH, while using only 1% of the data in previous approaches.

⚡ The Less-is-More Reasoning Hypothesis:
‣ Minimal but precise examples that showcase optimal reasoning patterns matter more than sheer quantity
‣ Pre-training knowledge plus sufficient computational resources at inference levels up math skills

➡️ Core techniques:
‣ High-quality reasoning chains with self-verification steps
‣ 817 handpicked problems that encourage deeper reasoning
‣ Enough inference-time computation to allow extended reasoning

💪 Efficiency gains:
‣ Only 817 examples instead of 100k+
‣ 40.5% absolute improvement across 10 diverse benchmarks, outperforming models trained on 100x more data

This really challenges the notion that SFT leads to memorization rather than generalization! And opens up reasoning to GPU-poor researchers 🚀

Read the full paper here 👉 LIMO: Less is More for Reasoning (2502.03387)

liked a model about 11 hours ago

microsoft/OmniParser-v2.0

Image-Text-to-Text • Updated about 16 hours ago • 1.4k • 559

New activity in m-ric/open_Deep-Research 2 days ago

added sidebar layout

#19 opened 2 days ago by

ysharma

liked a Space 4 days ago

Notebook To Markdown Converter

🌖

Converts jupyter notebooks to markdown files.

updated a Space 4 days ago

Notebook To Markdown Converter

🌖

Converts jupyter notebooks to markdown files.

published a Space 4 days ago

Notebook To Markdown Converter

🌖

Converts jupyter notebooks to markdown files.

upvoted an article 4 days ago

Article

1 Billion Classifications

6 days ago

• 37

commented on Welcome Fireworks.ai on the Hub 🎆 4 days ago

Amazing, looking forward to useing it!

upvoted an article 4 days ago

Article

Welcome Fireworks.ai on the Hub 🎆

5 days ago

• 45

posted an update 4 days ago

Post

2460

𝗚𝗿𝗲𝗮𝘁 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗮𝗹𝗲𝗿𝘁: you can now share agents to the Hub! 🥳🥳

And any agent pushed to Hub get a cool Space interface to directly chat with it.

This was a real technical challenge: for instance, serializing tools to export them meant that you needed to get all the source code for a tool, verify that it was standalone (not relying on external variables), and gathering all the packages required to make it run.

Go try it out! 👉 https://github.com/huggingface/smolagents