Loubna Ben Allal

loubnabnl

AI & ML interests

LLMs, ML for code, Synthetic data

Recent Activity

Articles

Organizations

loubnabnl's activity

updated a Space about 21 hours ago
posted an update about 21 hours ago
view post
Post
462
Making SmolLM2 reproducible: open-sourcing our training & evaluation toolkit πŸ› οΈ https://github.com/huggingface/smollm/

- Pre-training code with nanotron
- Evaluation suite with lighteval
- Synthetic data generation using distilabel (powers our new SFT dataset HuggingFaceTB/smoltalk)
- Post-training scripts with TRL & the alignment handbook
- On-device tools with llama.cpp for summarization, rewriting & agents

Apache 2.0 licensed. V2 pre-training data mix coming soon!

Which other tools should we add next?
Reacted to prithivMLmods's post with πŸ”₯ 1 day ago
view post
Post
2435
Weekend Dribble πŸ“¦πŸΊ

Adapters for Product Ad Backdrops, Smooth Polaroids, Minimalist Sketch cards, Super Blends!!

🀏Demo on: prithivMLmods/FLUX-LoRA-DLC

Stranger Zones :
πŸ‘‰πŸΌ{ Super Blend } : strangerzonehf/Flux-Super-Blend-LoRA

πŸ‘‰πŸΌ{ Product Concept Ad } : prithivMLmods/Flux-Product-Ad-Backdrop
πŸ‘‰πŸΌ{ Frosted Mock-ups } : prithivMLmods/Flux.1-Dev-Frosted-Container-LoRA
πŸ‘‰πŸΌ{ Polaroid Plus } : prithivMLmods/Flux-Polaroid-Plus
πŸ‘‰πŸΌ{Sketch Cards} : prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA

πŸ‘‰Stranger Zone: https://huggingface.co/strangerzonehf

πŸ‘‰Flux LoRA Collections: prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

.
.
.
@prithivMLmods πŸ€—
Reacted to merve's post with β€οΈπŸš€ 1 day ago
view post
Post
2536
your hugging face profile now has your recent activities πŸ€—
New activity in HuggingFaceTB/SmolLM2-135M-Instruct 1 day ago

add base model metadata

1
#4 opened 11 days ago by davanstrien
New activity in HuggingFaceTB/SmolLM2-360M-Instruct 3 days ago

finetuning

4
#2 opened 25 days ago by HassanStar
Reacted to merve's post with πŸ”₯ 3 days ago
view post
Post
2213
What a week! A recap for everything you missed ❄️
merve/nov-22-releases-673fbbcfc1c97c4f411def07
Multimodal ✨
> Mistral AI
released Pixtral 124B, a gigantic open vision language model
> Llava-CoT (formerly known as Llava-o1) was released, a multimodal reproduction of o1 model by PKU
> OpenGVLab released MMPR: a new multimodal reasoning dataset
> Jina has released Jina-CLIP-v2 0.98B multilingual multimodal embeddings
> Apple released new SotA vision encoders AIMv2

LLMs πŸ¦™
> AllenAI dropped a huge release of models, datasets and scripts for TΓΌlu, a family of models based on Llama 3.1 aligned with SFT, DPO and a new technique they have developed called RLVR
> Jina has released embeddings-v3: new multilingual embeddings with longer context
> Hugging Face released SmolTalk: synthetic dataset used to align SmolLM2 using supervised fine-tuning
> Microsoft released orca-agentinstruct-1M-v1: a gigantic instruction dataset of 1M synthetic instruction pairs

Image Generation πŸ–ΌοΈ
> Black Forest Labs released Flux 1. tools: four new models for different image modifications and two LoRAs to do image conditioning and better steer generations

Lastly Hugging Face released a new library Observers: a lightweight SDK for monitoring interactions with AI APIs and easily store and browse them πŸ“š
$ pip install observers
  • 2 replies
Β·