The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 4 days ago • 166
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Paper • 2502.07374 • Published 6 days ago • 30
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 7 days ago • 31
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 6 days ago • 21
view article Article Topic 27: What are Chain-of-Agents and Chain-of-RAG? By Kseniase and 1 other • 4 days ago • 8
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 6 days ago • 42
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 7 days ago • 125
Expect the Unexpected: FailSafe Long Context QA for Finance Paper • 2502.06329 • Published 7 days ago • 118
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published 14 days ago • 9
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 13 days ago • 175
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 25 days ago • 62
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 19 days ago • 54
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 14 days ago • 176
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 14 days ago • 112
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 18 days ago • 81