3 11 109

Adriel Martins

Martins6

https://github.com/Martins6

Martins6

AI & ML interests

Graph Neural Networks (GNN) & Robot Learning & Multimodal AI

Recent Activity

liked a dataset 10 days ago

HuggingFaceM4/DocumentVQA

liked a dataset 10 days ago

HuggingFaceM4/WebSight

liked a model 10 days ago

HuggingFaceM4/idefics2-8b

View all activity

Organizations

None yet

Martins6's activity

liked 2 datasets 10 days ago

HuggingFaceM4/DocumentVQA

Viewer • Updated Dec 18, 2023 • 50k • 1.67k • 25

HuggingFaceM4/WebSight

Viewer • Updated Mar 26, 2024 • 2.75M • 6.88k • 337

liked 2 models 10 days ago

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • Updated Oct 14, 2024 • 17.7k • 601

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • Updated Dec 2, 2024 • 65.1k • 307

liked a dataset 10 days ago

HuggingFaceTB/smol-smoltalk

Viewer • Updated Nov 21, 2024 • 485k • 671 • 22

liked a dataset 12 days ago

pixparse/idl-wds

Viewer • Updated Mar 29, 2024 • 3.41M • 3.93k • 177

liked a dataset 13 days ago

pixparse/pdfa-eng-wds

Viewer • Updated Mar 29, 2024 • 7.1k • 4.15k • 142

liked a Space 15 days ago

Running

155

🏃

Jupyter Agent

reacted to thomwolf's post with 🤗🔥🚀 25 days ago

Post

4508

We are proud to announce HuggingFaceFW/fineweb-2: A sparkling update to HuggingFaceFW/fineweb with 1000s of 🗣️languages.

We applied the same data-driven approach that led to SOTA English performance in🍷 FineWeb to thousands of languages.

🥂 FineWeb2 has 8TB of compressed text data and outperforms other multilingual datasets in our experiments.

The dataset is released under the permissive 📜 ODC-By 1.0 license, and the 💻 code to reproduce it and our evaluations is public.

We will very soon announce a big community project, and are working on a 📝 blogpost walking you through the entire dataset creation process. Stay tuned!

In the mean time come ask us question on our chat place: HuggingFaceFW/discussion

H/t @guipenedo @hynky @lvwerra as well as @vsabolcec Bettina Messmer @negar-foroutan and @mjaggi

2 replies

liked a model 25 days ago

alibaba-pai/VideoCLIP-XL

Updated Oct 7, 2024 • 9

New activity in lmms-lab/LLaVA-Video-7B-Qwen2 27 days ago

Missing steps

#8 opened 2 months ago by

Martins6

liked 3 models about 1 month ago

reacted to fdaudens's post with 👍🔥🚀👀 about 2 months ago

Post

1834

Fascinating point from @thomwolf at Web Summit: AI misuse (deepfakes, fake news) is actually easier to make with closed models, not with open-source ones.

This challenges the common narrative that open-source AI is inherently more dangerous. The reality is more nuanced - while we may think open source is technically easier to misuse, closed models' accessibility and product-focused design appear to be driving more actual harm.

Important context for current AI safety discussions and regulation debates.

Do you agree? 👇

1 reply