Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a dataset 25 minutes ago

smirki/UI_Reasoning_Dataset

liked a Space about 2 hours ago

lllyasviel/LuminaBrush

liked a model about 2 hours ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview

View all activity

Organizations

victor's activity

upvoted a paper about 4 hours ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 4 days ago • 166

upvoted a paper 1 day ago

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published 6 days ago • 30

upvoted an article 1 day ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

By

•

7 days ago

• 31

upvoted 3 articles 3 days ago

Article

Welcome Fireworks.ai on the Hub 🎆

4 days ago

• 18

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

6 days ago

• 21

Article

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

By

and 1 other •

4 days ago

• 8

upvoted an article 4 days ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

6 days ago

• 42

upvoted 2 papers 5 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 7 days ago • 125

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published 7 days ago • 118

upvoted an article 5 days ago

Article

Open R1: Update #2

By

and 6 others •

7 days ago

• 171

upvoted a paper 7 days ago

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published 14 days ago • 9

upvoted an article 11 days ago

Article

G2P Shrinks Speech Models

By

•

12 days ago

• 23

upvoted a paper 11 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 13 days ago • 175

upvoted an article 11 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

25 days ago

• 62

upvoted 4 papers 13 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 19 days ago • 54

s1: Simple test-time scaling

Paper • 2501.19393 • Published 17 days ago • 102

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 14 days ago • 176

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 14 days ago • 112

upvoted an article 15 days ago

Article

Open-R1: Update #1

By

and 7 others •

16 days ago

• 280

upvoted a paper 17 days ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 18 days ago • 81