Loubna Ben Allal's picture

Loubna Ben Allal

loubnabnl

·

https://loubnabnl.github.io/

AI & ML interests

SmolLMs, ML for code, data

Recent Activity

reacted to lysandre's post with ❤️ 1 day ago

SmolVLM-2 and SigLIP-2 are now part of `transformers` in dedicated releases! They're added on top of the v4.49.0 release, and can be installed from the following tags: `v4.49.0-SmolVLM-2` and `v4.49.0-SigLIP-2`. This marks a new beginning for the release process of transformers. For the past five years, we've been doing monthly releases featuring many models (v4.49.0, the latest release, features 9 new architectures). Starting with SmolVLM-2 & SigLIP2, we'll now additionally release tags supporting new models on a stable branch. These models are therefore directly available for use by installing from the tag itself. These tags will continue to be updated with fixes applied to these models. Going forward, continue expecting software releases following semantic versioning: v4.50.0 will have ~10 new architectures compared to v4.49.0, as well as a myriad of new features, improvements and bug fixes. Accompanying these software releases, we'll release tags offering brand new models as fast as possible, to make them accessible to all immediately.

liked a Space 3 days ago

m-ric/open_Deep-Research

published a model 3 days ago

HuggingFaceTB/SmolLM2-1.7B-Instruct-16k

View all activity

Organizations

loubnabnl's activity

upvoted an article 4 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

5 days ago

• 146

upvoted a paper 8 days ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published 13 days ago • 44

upvoted an article 14 days ago

Article

Open R1: Update #2

By

and 6 others •

14 days ago

• 184

upvoted a paper 18 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 20 days ago • 190

upvoted an article 28 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

28 days ago

• 771

upvoted an article about 1 month ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 142

upvoted a paper about 1 month ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 31

upvoted an article about 1 month ago

Article

Diving into MiniMax01 405B MoE

By

•

Jan 15

• 17

upvoted an article about 2 months ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

Jan 3

• 35

upvoted a collection 3 months ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 4 days ago • 34

upvoted an article 6 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

upvoted a collection 6 months ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated 4 days ago • 49

upvoted an article 7 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 321

upvoted a paper 8 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 93

upvoted a paper 9 months ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12

upvoted 2 collections 11 months ago

Leaderboards and benchmarks ✨

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 90 items • Updated 19 days ago • 96

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 235

upvoted a paper 12 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 138

upvoted a collection 12 months ago

💫 StarCoder2

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 83

upvoted a paper over 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123