Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
MoritzLaurer
's Collections
Zeroshot Classifiers
leaderboards
synthetic data
Code Generation
other-useful
other-interesting
leaderboards
updated
Nov 19
Upvote
5
Running
on
CPU Upgrade
59
π
LeaderboardExplorer
Running
3.79k
ππ€
Chatbot Arena Leaderboard
Running
on
CPU Upgrade
12k
π
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Running
on
CPU Upgrade
4.4k
π₯
MTEB Leaderboard
Running
on
CPU Upgrade
571
π
Open ASR Leaderboard
Running
391
πποΈ
LLM-Perf Leaderboard
Running
1.03k
π
Big Code Models Leaderboard
Runtime error
78
π©ββοΈπ€π€
Human & GPT-4 Evaluation of LLMs Leaderboard
Running
425
π
Can Ai Code Results
Running
on
CPU Upgrade
126
π₯
Hallucinations Leaderboard
Running
on
CPU Upgrade
105
π₯
Enterprise Scenarios Leaderboard
Running
on
CPU Upgrade
85
π₯
LLM Safety Leaderboard
Running
532
πΌπ¬
Vision Arena (Testing VLMs side-by-side)
Running
57
π
CyberSecEvalTest
Running
40
π»
Redteaming Resistance Leaderboard
Running
51
π¦Ύπ€
Arena Hard
Running
266
π¨
LLM Performance Leaderboard
Running
on
CPU Upgrade
63
π₯
AIR-Bench Leaderboard
Running
on
CPU Upgrade
541
π
Open VLM Leaderboard
VLMEvalKit Evaluation Results Collection
Running
298
π
Reward Bench Leaderboard
Running
148
π₯
BigCodeBench Leaderboard
Running
10
π₯
MJ Bench Leaderboard
Running
83
βοΈ
MTEB Arena
Runtime error
146
π¬
Open LLM Progress Tracker
Running
83
π»
Judge Arena
Upvote
5
+1
Share collection
View history
Collection guide
Browse collections