Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a collection about 3 hours ago

🧠 Reasoning datasets

liked a dataset about 3 hours ago

bethgelab/CuratedThoughts

liked a Space about 4 hours ago

nanotron/ultrascale-playbook

View all activity

Organizations

lewtun's activity

upvoted a collection 6 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 12 items • Updated about 3 hours ago • 74

upvoted a paper 8 days ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 17 days ago • 59

upvoted a collection 9 days ago

OpenR1-Math

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 3 items • Updated 6 days ago • 6

upvoted an article 10 days ago

Article

Open R1: Update #2

By

and 6 others •

10 days ago

• 179

upvoted a paper 14 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 16 days ago • 186

upvoted an article 14 days ago

Article

Smol but Mighty: Can Small Models Reason well? 🤔

By

•

16 days ago

• 7

upvoted an article 18 days ago

Article

Open-R1: Update #1

By

and 7 others •

18 days ago

• 284

upvoted an article 20 days ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

20 days ago

• 34

upvoted an article 23 days ago

Article

Welcome to Inference Providers on the Hub 🔥

24 days ago

• 378

upvoted an article 24 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

24 days ago

• 767

upvoted 3 articles about 1 month ago

Article

Gradio spaces are the perfect agent tools\!

By

•

Jan 17

• 14

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 722

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16

• 68

upvoted a paper about 1 month ago

A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Paper • 2411.19477 • Published Nov 29, 2024 • 6

upvoted 5 papers about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 8

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

Paper • 1610.02424 • Published Oct 7, 2016 • 1

Scaling Laws for Neural Language Models

Paper • 2001.08361 • Published Jan 23, 2020 • 7

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 51

upvoted a paper 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134