Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
36.1
TFLOPS
28
6
47
nicolo
nicolollo
Follow
21world's profile picture
1 follower
Ā·
7 following
AI & ML interests
None yet
Recent Activity
reacted
to
lewtun
's
post
with š
3 days ago
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute š„ How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: š Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. š Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. š§ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!
reacted
to
burtenshaw
's
post
with ā¤ļø
10 days ago
Quick update from week 1 of smol course. The community is taking the driving seat and using the material for their own projects. If you want to do the same, join in! - we have ongoing translation projects in Korean, Vietnamese, Portuguese, and Spanish - 3 chapters are ready for students. On topics like, instruction tuning, preference alignment, and parameter efficient fine tuning - 3 chapters are in progress on evaluation, vision language models, and synthetic data. - around 780 people have forked the repo to use it for learning, teaching, sharing. āļø Next step is to support people that want to use the course for teaching, content creation, internal knowledge sharing, or anything. If you're into this. Drop an issue or PR REPO: https://buff.ly/3ZCMKX2 discord channel: https://buff.ly/4f9F8jA
liked
a dataset
10 days ago
Xkev/LLaVA-CoT-100k
View all activity
Organizations
nicolollo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 datasets
10 days ago
Xkev/LLaVA-CoT-100k
Viewer
ā¢
Updated
23 days ago
ā¢
98.6k
ā¢
2.17k
ā¢
55
5CD-AI/LLaVA-CoT-o1-Instruct
Viewer
ā¢
Updated
23 days ago
ā¢
58.5k
ā¢
528
ā¢
59
liked
a model
16 days ago
AdaptLLM/Adapt-MLLM-to-Domains
Updated
6 days ago
ā¢
9
liked
a Space
25 days ago
Running
83
š»
Judge Arena
liked
2 datasets
about 1 month ago
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
ā¢
Updated
Nov 18
ā¢
1.05M
ā¢
2.26k
ā¢
52
microsoft/orca-agentinstruct-1M-v1
Viewer
ā¢
Updated
Nov 1
ā¢
1.05M
ā¢
14k
ā¢
404
liked
a model
about 1 month ago
HV-Khurdula/Dua-Vision-Base
Image-Text-to-Text
ā¢
Updated
Oct 29
ā¢
33
ā¢
3
liked
a model
about 2 months ago
neulab/Pangea-7B
Updated
Oct 24
ā¢
6.28k
ā¢
122
liked
4 datasets
about 2 months ago
MohamedRashad/easy_imageinwords
Viewer
ā¢
Updated
May 13
ā¢
2.4k
ā¢
40
ā¢
3
kadirnar/fluxdev_controlnet_16k
Viewer
ā¢
Updated
Aug 16
ā¢
16.1k
ā¢
131
ā¢
25
google/docci
Updated
Jul 24
ā¢
399
ā¢
62
google/imageinwords
Updated
May 25
ā¢
162
ā¢
116
liked
2 models
about 2 months ago
huihui-ai/Qwen2-VL-2B-Instruct-abliterated
Image-Text-to-Text
ā¢
Updated
Nov 19
ā¢
405
ā¢
5
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
ā¢
Updated
15 days ago
ā¢
973k
ā¢
324
liked
a dataset
about 2 months ago
wangclnlp/vision-feedback-mix-binarized-cleaned
Viewer
ā¢
Updated
Jul 21
ā¢
98.3k
ā¢
61
ā¢
7
liked
2 models
2 months ago
Zyphra/Zamba2-7B-Instruct
Text Generation
ā¢
Updated
Oct 18
ā¢
749
ā¢
83
deepseek-ai/Janus-1.3B
Any-to-Any
ā¢
Updated
Nov 14
ā¢
6.76k
ā¢
479
liked
2 models
3 months ago
Qwen/Qwen2.5-7B-Instruct
Text Generation
ā¢
Updated
Sep 25
ā¢
2.04M
ā¢
364
Qwen/Qwen2.5-72B-Instruct
Text Generation
ā¢
Updated
Sep 25
ā¢
270k
ā¢
ā¢
615
liked
a Space
3 months ago
Running
296
š§¬
Synthetic Data Generator
Build datasets using natural language
Load more