Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 16 days ago • 51
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 23 days ago • 99
Running on CPU Upgrade 4.8k 4.8k MTEB Leaderboard 🥇 Select benchmarks and languages for text embeddings evaluation
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 90 items • Updated 13 days ago • 96
Running on CPU Upgrade 62 62 LeaderboardExplorer 🔎 Filter and display leaderboards based on selected criteria
Running on CPU Upgrade 85 85 Open LLM Leaderboard Model Comparator 🏆 Compare Open LLM Leaderboard results