nicolo

nicolollo

AI & ML interests

None yet

Recent Activity

reacted to lewtun's post with 👍 3 days ago

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥 How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: 📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. 🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. 🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!

reacted to burtenshaw's post with ❤️ 10 days ago

Quick update from week 1 of smol course. The community is taking the driving seat and using the material for their own projects. If you want to do the same, join in! - we have ongoing translation projects in Korean, Vietnamese, Portuguese, and Spanish - 3 chapters are ready for students. On topics like, instruction tuning, preference alignment, and parameter efficient fine tuning - 3 chapters are in progress on evaluation, vision language models, and synthetic data. - around 780 people have forked the repo to use it for learning, teaching, sharing. ⏭️ Next step is to support people that want to use the course for teaching, content creation, internal knowledge sharing, or anything. If you're into this. Drop an issue or PR REPO: https://buff.ly/3ZCMKX2 discord channel: https://buff.ly/4f9F8jA

liked a dataset 10 days ago

Xkev/LLaVA-CoT-100k

View all activity

Organizations

nicolollo's activity

liked 2 datasets 10 days ago

Xkev/LLaVA-CoT-100k

Viewer • Updated 23 days ago • 98.6k • 2.17k • 55

5CD-AI/LLaVA-CoT-o1-Instruct

Viewer • Updated 23 days ago • 58.5k • 528 • 59

liked a model 16 days ago

AdaptLLM/Adapt-MLLM-to-Domains

Updated 6 days ago • 9

liked a Space 25 days ago

Running

💻

Judge Arena

liked 2 datasets about 1 month ago

mlabonne/orca-agentinstruct-1M-v1-cleaned

Viewer • Updated Nov 18 • 1.05M • 2.26k • 52

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1 • 1.05M • 14k • 404

liked a model about 1 month ago

HV-Khurdula/Dua-Vision-Base

Image-Text-to-Text • Updated Oct 29 • 33 • 3

liked a model about 2 months ago

neulab/Pangea-7B

Updated Oct 24 • 6.28k • 122

liked 4 datasets about 2 months ago

liked 2 models about 2 months ago

huihui-ai/Qwen2-VL-2B-Instruct-abliterated

Image-Text-to-Text • Updated Nov 19 • 405 • 5

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • Updated 15 days ago • 973k • 324

liked a dataset about 2 months ago

wangclnlp/vision-feedback-mix-binarized-cleaned

Viewer • Updated Jul 21 • 98.3k • 61 • 7

liked 2 models 2 months ago

Zyphra/Zamba2-7B-Instruct

Text Generation • Updated Oct 18 • 749 • 83

deepseek-ai/Janus-1.3B

Any-to-Any • Updated Nov 14 • 6.76k • 479

liked 2 models 3 months ago

Qwen/Qwen2.5-7B-Instruct

Text Generation • Updated Sep 25 • 2.04M • 364

Qwen/Qwen2.5-72B-Instruct

Text Generation • Updated Sep 25 • 270k • • 615

liked a Space 3 months ago

Running

296

🧬

Synthetic Data Generator

Build datasets using natural language