LiveBench / app.py

Commit History

Add GPT-4 & human eval tab
0227006

sheonhan commited on

Add search emoji
92ae76d

sheonhan commited on

Search on ENTER
48c5442

sheonhan commited on

Increase concurrency count
f458f0b

sheonhan commited on

import datetime correctly
d35aee2

sheonhan commited on

record submitted time
8696209

sheonhan commited on

style clean up
aa7c3f4

sheonhan commited on

implements search bar
ffefe11

sheonhan commited on

Add citation button
2a73469

sheonhan commited on

Simply layout
c131125

sheonhan commited on

clean up vars
4c8dd3c

sheonhan commited on

sync with the internal version
58733e4

sheonhan commited on

Auto-restart every hour
46f8d78

sheonhan commited on

start every 20 minutes
9567fa6

sheonhan commited on

use the same H4_TOKEN for restart
0a3d32f

sheonhan commited on

use BackgroundScheduler to restart space
10f9b3c

sheonhan commited on

rename block to demo
01233b7

sheonhan commited on

sort imports and import BackgroundScheduler
4596a70

sheonhan commited on

Tweak change log style
50a344f

sheonhan commited on

Add CHANGELOG
6ed68b6

sheonhan commited on

Add a baseline
35a0978

sheonhan commited on

Update app.py
d3fbe10

edbeeching HF staff commited on

copy edit
80ac3c2

sheonhan commited on

do not display incomplete models for now
1363c8a

sheonhan commited on

reject duplicate submission
f742519

sheonhan commited on

style update
a885f09

sheonhan commited on

display message after eval submission
85dbbc4

sheonhan commited on

Fix TruthQA typo
614ee1f

lewtun HF staff commited on

Automatically refresh the leaderboard when loading the space (#8)
3693dc6

edbeeching HF staff osanseviero commited on

refining for release
07bfeca

edbeeching commited on

updated text
a460f7a

edbeeching commited on

added public option
db6f218

edbeeching commited on

fixes mixup with 8bit eval and private
5cb1426

edbeeching commited on

adds base model for delta weight merging
a095268

edbeeching commited on

updates table to include revision
fcb01e3

edbeeching commited on

adds revision option
b2c063a

edbeeching commited on

refactoring leaderboard
f90ad24

edbeeching commited on

fixed a bug when loading results
a7919f0

edbeeching commited on

updates eval leaderboard so new evals can be added
1f60a20

edbeeching commited on

creates leaderboard
9346f1c

edbeeching commited on