3 4 11

Frederic Branchaud-Charron

Dref360

https://dref360.github.io/

Dref360

AI & ML interests

Bayesian deep learning, uncertainty estimation, and trustworthiness.

Recent Activity

upvoted a paper 5 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

upvoted a collection 6 days ago

InternVL2.5

reacted to lewtun's post with ❤️ 6 days ago

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥 How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: 📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. 🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. 🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!

View all activity

Organizations

Dref360's activity

upvoted a paper 5 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 9 days ago • 130

upvoted a collection 6 days ago

InternVL2.5

Collection

Better than InternVL 2.0 • 18 items • Updated 1 day ago • 73

reacted to lewtun's post with ❤️🔥 6 days ago

Post

6337

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!

2 replies

liked a Space 6 days ago

Running

331

📝

Scaling test-time compute

reacted to burtenshaw's post with ❤️ 19 days ago

Post

2523

For anyone looking to boost their LLM fine-tuning and alignment skills this decemeber. We're running this free and open course called smol course. It’s not big like Li Yin and @mlabonne , it’s just smol.

👷 It focuses on practical use cases, so if you’re working on something, bring it along.

👯‍♀️ It’s peer reviewed and open so you can discuss and get feedback.

🤘 If you’re already a smol pro, feel free to drop a star or issue.

> > Part 1 starts now, and it’s on instruction tuning!

https://github.com/huggingface/smol-course

reacted to andito's post with 🔥❤️ 25 days ago

Post

3223

Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

- SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🤯
- Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! 🚀
- SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU!
- SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos!

Check out more!
Demo: HuggingFaceTB/SmolVLM
Blog: https://huggingface.co/blog/smolvlm
Model: HuggingFaceTB/SmolVLM-Instruct
Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

reacted to merve's post with 🚀🔥 26 days ago

Post

3847

Small yet mighty! 💫

We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient 🤠

We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base HuggingFaceTB/smolvlm-6740bd584b2dcbf51ecb1f39

Learn more from our blog here: huggingface.co/blog/smolvlm
This release comes with a demo, fine-tuning code, MLX integration and TRL integration for DPO 💝
Try the demo: HuggingFaceTB/SmolVLM
Fine-tuning Recipe: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
Also TRL integration for DPO 💗

posted an update 27 days ago

Post

1271

New week, new #cv Gradio app for human understanding.( Dref360/human-interaction-demo) 🥳

This demo highlights when a person touches an object. For instance, it is useful to know if someone is touching a wall, a vase or a door. It works for multiple people too!

Still using nielsr/vitpose-base-simple for pose estimation, very excited to see the PR approved!

updated a Space 29 days ago

Running

📊

Human Interaction Demo

Uses pose estimation to determine what are you touching.

liked a model about 1 month ago

danelcsb/vitpose-base-simple

Keypoint Detection • Updated 29 days ago • 40 • 1

reacted to jsulz's post with 🧠❤️🔥 about 1 month ago

Post

2910

When the XetHub crew joined Hugging Face this fall, @erinys and I started brainstorming how to share our work to replace Git LFS on the Hub. Uploading and downloading large models and datasets takes precious time. That’s where our chunk-based approach comes in.

Instead of versioning files (like Git and Git LFS), we version variable-sized chunks of data. For the Hugging Face community, this means:

⏩ Only upload the chunks that changed.
🚀 Download just the updates, not the whole file.
🧠 We store your file as deduplicated chunks

In our benchmarks, we found that using CDC to store iterative model and dataset version led to transfer speedups of ~2x, but this isn’t just a performance boost. It’s a rethinking of how we manage models and datasets on the Hub.

We're planning on our new storage backend to the Hub in early 2025 - check out our blog to dive deeper, and let us know: how could this improve your workflows?

https://huggingface.co/blog/from-files-to-chunks

New activity in mhyatt000/YOLOv5 about 1 month ago

🚩 Report: Spam

#1 opened about 1 month ago by

Dref360

updated a Space about 1 month ago

Running

🏆

Vit Pose Playground

Small Space to test ViTPose

posted an update about 1 month ago

Post

2255

Sharing a new space to test out ViTPose, a pose estimation model using Visual Transformers.

[ViTPose Playground]( Dref360/vit_pose_playground)

This model will be available in transformers once [#30530](https://github.com/huggingface/transformers/pull/30530) is merged. Huge shoutout to @nielsr and @danelcsb for bringing this to HF!

Here's the result on my Ken Halloween costume.

2 replies

reacted to averoo's post with 👍 about 2 months ago

Post

3790

Hello, researchers! I've tried to made reading HF Daily Papers easier and made a tool that does reviews with LLMs like Claude 3.5, GPT-4o and sometimes FLUX.

📚 Classification by topics
📅 Sorting by publication date and HF addition date
🔄 Syncing every 2 hours
💻 Hosted on GitHub
🌏 English, Russian, and Chinese
📈 Top by week/month (in progress)

👉 https://hfday.ru

Let me know what do you think of it.