Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
Duplicated from
nan/leaderboard
AIR-Bench
/
leaderboard
like
55
Running
on
CPU Upgrade
App
Files
Files
Community
25
ca2a141
leaderboard
/
src
3 contributors
History:
54 commits
hanhainebula
Modify the commands of evaluating
ca2a141
verified
4 months ago
display
fix: fix the bug in duplicated columns
4 months ago
about.py
4.42 kB
Modify the commands of evaluating
4 months ago
benchmarks.py
4.41 kB
Add msmarco for qa task
4 months ago
envs.py
754 Bytes
chore: clean up
4 months ago
read_evals.py
8.39 kB
Fix check when loading results file
4 months ago
utils.py
9.54 kB
fix: fix the bug in the annoymous checkbox
4 months ago