open_llm_leaderboard / src /leaderboard

Commit History

debugging the codebase
1489ff1

Alina Lozovskaia commited on

ported new app.py [wip]
a03f0fa

Alina Lozovskaia commited on

dummy column refactoring (#688)
b7d036c
verified

alozowski HF staff commited on

search-update (#662)
0a3530a
verified

alozowski HF staff commited on

gating issues fix
c81dadf

Clémentine commited on

commit
df0b79f

Nathan Habib commited on

add flag on contaminated model
d1e81be
verified

clefourrier HF staff commited on

fixing display bug when importing missing tags
26bbde7

Clémentine commited on

fix display
4ccfada

Clémentine commited on

fix updater for adapter weights/merges + add some flags
ea04e0b

Clémentine commited on

added filters of #540
f6aad8d

Clémentine commited on

change model types available at submission time
05bda40

Clémentine commited on

flag models
4b67a33

Nathan Habib commited on

Update src/leaderboard/filter_models.py
2014251
verified

clefourrier HF staff commited on

Update src/leaderboard/filter_models.py
f6d5857
verified

clefourrier HF staff commited on

update flag for moe
80f473c

Clémentine commited on

Update src/leaderboard/filter_models.py
78e2c07
verified

clefourrier HF staff commited on

Update src/leaderboard/filter_models.py
2227d54
verified

clefourrier HF staff commited on

Removed flag from models with correct metadata
530580c
verified

clefourrier HF staff commited on

Added check to hide non FINISHED models
d9f882d

Clémentine commited on

add new flags
6c60c29

Clémentine commited on

simplified display, added an extra config repo to carry dynamic information
9b2e755

Clémentine commited on

wip
0c7ef71

Clémentine commited on

Update src/leaderboard/read_evals.py
3b554b5

clefourrier HF staff commited on

Incorrectly tagged merges are now flagged
90fa47e

Clémentine commited on

Added checkbox for merges
b762711

Clémentine commited on

flag model
991b9e1

Nathan Habib commited on

flag model
511d367

Nathan Habib commited on

adding merge check - super slow but at least info is displayed
20b060e

Clémentine commited on

flag models
c841f87

Nathan Habib commited on

flag models
425be57

Nathan Habib commited on

flag models
d93b3d2

Nathan Habib commited on

flag models
42f5749

Nathan Habib commited on

flag models
71834c1

Nathan Habib commited on

flag models
c1d0f7f

Nathan Habib commited on

nathan-flagged-models-vis (#478)
460ecf2

clefourrier HF staff commited on

added flag
783ccc5

Clémentine commited on

added tigerbot models to do not submit per authors request
202d26e

Clémentine commited on

flagging tiger models
e629df0

Nathan Habib commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on