EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published 7 days ago • 31
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 7 days ago • 179
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 7 days ago • 136
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 8 days ago • 47
High-Fidelity Simultaneous Speech-To-Speech Translation Paper • 2502.03382 • Published 15 days ago • 8
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 15 days ago • 186
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 16 days ago • 55
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 137
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 20 days ago • 35
Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts Paper • 2501.14334 • Published 27 days ago • 19
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 28 days ago • 62
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 29 days ago • 35
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • about 1 month ago • 34