Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 6 days ago • 300
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Paper • 2501.10132 • Published 16 days ago • 17
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 11 days ago • 22
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 11 days ago • 47
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 546
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 18 days ago • 130
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging By akjindal53244 • Aug 19, 2024 • 76