Running on CPU Upgrade 184 184 MMLU-Pro Leaderboard 🥇 More advanced and challenging multi-task evaluation
Running 222 222 AI2 WildBench Leaderboard (V2) 🦁 Display and explore model leaderboards and chat history
Running on CPU Upgrade 4.77k 4.77k MTEB Leaderboard 🥇 Select and filter benchmarks for text embedding tasks