Stefano Antonioni's picture

190 195

Stefano Antonioni

diddl1970

·

AI & ML interests

NLP, LLM

Recent Activity

liked a model 9 days ago

unsloth/DeepSeek-R1-GGUF

upvoted a collection 10 days ago

upvoted a collection 10 days ago

View all activity

Organizations

diddl1970's activity

upvoted 3 collections 10 days ago

DeepSeek-VL2

5 items • Updated 7 days ago • 67

DeepSeek-V3

3 items • Updated Jan 6 • 183

DeepSeek-R1

8 items • Updated 27 days ago • 507

upvoted an article 12 days ago

Article

Welcome to Inference Providers on the Hub 🔥

20 days ago

• 362

upvoted a collection 13 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated 6 days ago • 65

upvoted 2 articles 13 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

20 days ago

• 748

Article

Open-R1: Update #1

By

and 7 others •

15 days ago

• 280

upvoted 12 collections 17 days ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 205

Qwen2-Audio

Audio-language model series based on Qwen2 • 4 items • Updated Nov 28, 2024 • 51

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 50

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 357

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 73

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 520

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 282

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 57

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated Jan 1 • 43

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 21 days ago • 99

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 21 days ago • 342

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 133

upvoted an article about 1 month ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 139