Commit History

Add raw results links if exists, and fix minor issues
aa7060a

eduagarcia commited on

Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard into merge_original
811ded7

eduagarcia commited on

Add env variable SHOW_INCOMPLETE_EVALS and order evaluation queue by priority
8aaf0e7

eduagarcia commited on

Allow old model metrics
6269bd0

eduagarcia commited on

Add new tasks and make leadboard work without new tasks evals
5639a81

eduagarcia commited on

support hf leaderboard format and my format
a69553b

eduagarcia commited on

fix updater for adapter weights/merges + add some flags
ea04e0b

Clémentine commited on

added filters of #540
f6aad8d

Clémentine commited on

Add hidden option
b234783

eduagarcia commited on

Feature: FIELD with original HF Leaderboard ranking
71ecfbb

eduagarcia commited on

Merge Origin - Rename model types (#1)
9839977
verified

eduagarcia commited on

Evaluation time metric and plot
359d8a9

eduagarcia commited on

change model types available at submission time
05bda40

Clémentine commited on

flag models
4b67a33

Nathan Habib commited on

Fix model eval links and remove huggingface icon from Leaderboard name
439afd4

eduagarcia commited on

Refactor code for adding generic tasks
36e3010

eduagarcia commited on

Update src/leaderboard/filter_models.py
2014251
verified

clefourrier HF staff commited on

Update src/leaderboard/filter_models.py
f6d5857
verified

clefourrier HF staff commited on

update flag for moe
80f473c

Clémentine commited on

Update src/leaderboard/filter_models.py
78e2c07
verified

clefourrier HF staff commited on

Update src/leaderboard/filter_models.py
2227d54
verified

clefourrier HF staff commited on

Removed flag from models with correct metadata
530580c
verified

clefourrier HF staff commited on

Added check to hide non FINISHED models
d9f882d

Clémentine commited on

add new flags
6c60c29

Clémentine commited on

simplified display, added an extra config repo to carry dynamic information
9b2e755

Clémentine commited on

wip
0c7ef71

Clémentine commited on

Update src/leaderboard/read_evals.py
3b554b5

clefourrier HF staff commited on

Incorrectly tagged merges are now flagged
90fa47e

Clémentine commited on

Added checkbox for merges
b762711

Clémentine commited on

flag model
991b9e1

Nathan Habib commited on

flag model
511d367

Nathan Habib commited on

adding merge check - super slow but at least info is displayed
20b060e

Clémentine commited on

flag models
c841f87

Nathan Habib commited on

flag models
425be57

Nathan Habib commited on

flag models
d93b3d2

Nathan Habib commited on

flag models
42f5749

Nathan Habib commited on

flag models
71834c1

Nathan Habib commited on

flag models
c1d0f7f

Nathan Habib commited on

nathan-flagged-models-vis (#478)
460ecf2

clefourrier HF staff commited on

added flag
783ccc5

Clémentine commited on

added tigerbot models to do not submit per authors request
202d26e

Clémentine commited on

flagging tiger models
e629df0

Nathan Habib commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on