Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
Duplicated from
nan/leaderboard
AIR-Bench
/
leaderboard
like
58
Running
on
CPU Upgrade
App
Files
Files
Community
28
a0387d8
leaderboard
/
src
/
benchmarks.py
Commit History
fix a bug in METRIC_LIST
443f557
hanhainebula
commited on
May 22
disable law-zh
8b7258f
verified
hanhainebula
commited on
May 20
Fix bug in dataset_dict: "gpt-3" -> "gpt3"
8102fce
verified
hanhainebula
commited on
May 19
Fix bug in dataset_dict: "health" -> "healthcare"
4a44211
verified
hanhainebula
commited on
May 19
Add msmarco for qa task
43fbed5
verified
hanhainebula
commited on
May 14
feat: improve the layout
32ebf18
nan
commited on
May 12
feat: adapt to the latest data format
1a2dba5
nan
commited on
May 11
chore: clean up
a96f80a
nan
commited on
May 10
feat: fix the table updating
f30cbcc
nan
commited on
May 10
feat: adapt UI in app.py
e8879cc
nan
commited on
May 9
feat: adapt the utils in app.py
9c49811
nan
commited on
May 9
feat: seperate the qa and longdoc tasks
9134169
nan
commited on
May 9
feat: adapt the data loading part
8b7a945
nan
commited on
May 9