Thomas Wolf's picture

Thomas Wolf PRO

thomwolf

·

https://thomwolf.io

AI & ML interests

NLP and open-source :-)

Recent Activity

liked a model about 14 hours ago

open-r1/OlympicCoder-7B

liked a dataset 1 day ago

facebook/natural_reasoning

posted an update 1 day ago

We've kept pushing our Open-R1 project, an open initiative to replicate and extend the techniques behind DeepSeek-R1. And even we were mind-blown by the results we got with this latest model we're releasing: ⚡️OlympicCoder (https://huggingface.co/open-r1/OlympicCoder-7B and https://huggingface.co/open-r1/OlympicCoder-32B) It's beating Claude 3.7 on (competitive) programming –a domain Anthropic has been historically really strong at– and it's getting close to o1-mini/R1 on olympiad level coding with just 7B parameters! And the best part is that we're open-sourcing all about its training dataset, the new IOI benchmark, and more in our Open-R1 progress report #3: https://huggingface.co/blog/open-r1/update-3 Datasets are are releasing: - https://huggingface.co/datasets/open-r1/codeforces - https://huggingface.co/datasets/open-r1/codeforces-cots - https://huggingface.co/datasets/open-r1/ioi - https://huggingface.co/datasets/open-r1/ioi-test-cases - https://huggingface.co/datasets/open-r1/ioi-sample-solutions - https://huggingface.co/datasets/open-r1/ioi-cots - https://huggingface.co/datasets/open-r1/ioi-2024-model-solutions

View all activity

Organizations

thomwolf's activity

upvoted an article 2 days ago

Article

Open R1: Update #3

By

and 9 others •

2 days ago

• 197

upvoted an article 17 days ago

Article

FastRTC: The Real-Time Communication Library for Python

17 days ago

• 143

upvoted a paper 17 days ago

Fully Autonomous AI Agents Should Not be Developed

Paper • 2502.02649 • Published Feb 4 • 27

upvoted a paper 22 days ago

Presumed Cultural Identity: How Names Shape LLM Responses

Paper • 2502.11995 • Published 25 days ago • 10

upvoted 2 articles 24 days ago

Article

1 Billion Classifications

29 days ago

• 42

Article

Fixing Open LLM Leaderboard with Math-Verify

28 days ago

• 27

upvoted an article 29 days ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

• 63

upvoted 2 papers about 1 month ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 57

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted 3 articles about 1 month ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 112

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 295

upvoted a collection about 1 month ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated about 14 hours ago • 93

upvoted 2 articles about 1 month ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 429

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 803

upvoted 2 articles about 2 months ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

Jan 15

• 43

Article

Diving into MiniMax01 405B MoE

By

•

Jan 15

• 17

upvoted a paper 2 months ago

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20

upvoted an article 2 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

Jan 2

• 40

upvoted a collection 2 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 565