Luca Baggi's picture

Luca Baggi

lucabaggi

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

tencent/HunyuanVideo

liked a model 5 days ago

Qwen/QwQ-32B

liked a model 6 days ago

SparkAudio/Spark-TTS-0.5B

View all activity

Organizations

lucabaggi's activity

upvoted a collection 7 days ago

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 7 days ago • 62

upvoted 2 collections 12 days ago

ColPali Models

Pre-trained checkpoints for the ColPali model. • 8 items • Updated Jan 23 • 4

ColQwen2 Models

Pre-trained checkpoints for the ColQwen2 model. • 4 items • Updated Jan 23 • 4

upvoted a paper 12 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 29 days ago • 142

upvoted a collection 13 days ago

Qwen2-Audio

Audio-language model series based on Qwen2 • 4 items • Updated Nov 28, 2024 • 53

upvoted a collection 2 months ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

upvoted 2 collections 4 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 29 days ago • 298

Domain-specific GLiNER

6 items • Updated Jun 17, 2024 • 6

upvoted a paper 4 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 114

upvoted a collection 4 months ago

🇮🇹 Italian NLP Resources

Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 285 items • Updated 9 days ago • 24

upvoted 2 papers 5 months ago

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published Oct 11, 2024 • 85

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

upvoted 2 collections 6 months ago

Moirai-R models

10 items • Updated 22 days ago • 38

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted an article 7 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 193

upvoted a collection 8 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 651

upvoted 2 collections 9 months ago

Gemma 2 Release

15 items • Updated Dec 13, 2024 • 216

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 359

upvoted a paper 10 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

upvoted a collection 10 months ago

Granite Code Models

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 15 days ago • 184