Hugo Laurençon's picture

Hugo Laurençon

HugoLaurencon

·

HugoLaurencon

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

upvoted a paper 15 days ago

Autonomy-of-Experts Models

upvoted a paper 21 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

View all activity

Organizations

HugoLaurencon's activity

New activity in HuggingFaceM4/idefics2-8b about 1 month ago

Seems like the user prompt is ignored

#80 opened about 2 months ago by

New activity in OS-Copilot/OS-Genesis-7B-AC about 1 month ago

Permission error to access data

#1 opened about 1 month ago by

commented a paper about 2 months ago

CompCap: Improving Multimodal Large Language Models with Composite Captions

Paper • 2412.05243 • Published Dec 6, 2024 • 18 •

New activity in HuggingFaceM4/idefics2-8b 4 months ago

Can we use idefics2-8b for document classification(Multi Page Document Boundary Classification)?

#77 opened 4 months ago by

The inconsistency between evaluation and training.

#76 opened 5 months ago by

New activity in HuggingFaceM4/idefics2-8b 5 months ago

Format modelcard better

#75 opened 5 months ago by

New activity in HuggingFaceM4/Idefics3-8B-Llama3 5 months ago

How to Effectively Run the Idefics 3 Model on AWS SageMaker for Inference

#15 opened 5 months ago by

New activity in HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit 5 months ago

Performance decrease compared to base Siglip model without Navit.

#7 opened 5 months ago by

commented a paper 5 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 125 •

New activity in HuggingFaceM4/Idefics3-8B-Llama3 5 months ago

How to use history prompts on the same image?

#12 opened 6 months ago by

New activity in HuggingFaceM4/idefics2-8b-base 5 months ago

Some issues regarding training

#9 opened 5 months ago by

New activity in HuggingFaceM4/Idefics3-8B-Llama3 5 months ago

Releasing base model and combined SFT dataset

#13 opened 5 months ago by

commented a paper 6 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 125 •

New activity in HuggingFaceM4/Idefics3-8B-Llama3 6 months ago

Image encoding / rescaling Question

#11 opened 6 months ago by

Fine tuning fails

#10 opened 6 months ago by

maorcatmyheritage

Update README.md

#9 opened 6 months ago by

New activity in HuggingFaceM4/idefics3 6 months ago

This is amazing!

#2 opened 6 months ago by

New activity in HuggingFaceM4/Idefics3-8B-Llama3 6 months ago

pretraining datasets

#8 opened 6 months ago by

New activity in HuggingFaceM4/idefics2-8b-base 6 months ago

Initializing SIGLIP vision model in Idefics2

#8 opened 6 months ago by

Tree

#7 opened 6 months ago by