LiveBench / src

Commit History

chore: Update Tasks enum values in about.py
046ddc7

pufanyi commited on

Update GOOGLE_SHEET_ID in envs.py
93dabac

pufanyi commited on

chore: Remove commented out code for model information in utils.py
d598d7d

pufanyi commited on

chore: Remove commented out code for model information in utils.py
65654bf

pufanyi commited on

chore: Update page title to "LiveBench"
8336bbd

pufanyi commited on

chore: Update about page title to "Live Bench"
24c1f06

pufanyi commited on

Revert "Update repository references in envs.py"
ce61fc8

pufanyi commited on

Update repository references in envs.py
1d340cf

pufanyi commited on

Update src/envs.py
adad63e
verified

clefourrier HF staff commited on

added leaderboard component to simplify main script
8b28d2b

Clémentine commited on

doc
c1b8a96

Clémentine commited on

simplified the template
24622c4

Clémentine commited on

CPU, TOKEN, env variables (#4)
55cc480
verified

clefourrier HF staff meg HF staff commited on

Update src/submission/check_validity.py
6eb8bfd

clefourrier HF staff commited on

made token a requirement
f982b8e

Clémentine commited on

test
f0298e1

Clémentine commited on

fix
c15e77e

Clémentine commited on

removed quantization to simplify
b899767

Clémentine commited on

now with a functionning backend
1ffc326

Clémentine commited on

update read
943f952

Clémentine commited on

fixs
314f91a

Clémentine commited on

updated leaderboard
efeee6d

Clémentine commited on

Simplified leaderboard v0
9833cdb

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

Added check on tokenizer to prevent submissions which won't run
7302987

Clémentine commited on

Update benchmark count and fix typo (`inetuning->finetuning`) (#395)
7abc6a7

clefourrier HF staff alvarobartt HF staff commited on

fix order of request file vs request file list, to avoid resubmitting issues
976f398

Clémentine commited on

cache
4ff9eef

Clémentine commited on

update for caching
395eff6

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Simplify About
eaace79

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

fix value error in param size
ccefec9

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on

add new evals to the leaderboard
e3aaf53

Nathan Habib commited on

token for checking gated base models
f3cda22

Clémentine commited on

Fix BibTex author ordering (#342)
216309b

clefourrier HF staff lewtun HF staff commited on

fix disapearing models
280033c

Nathan Habib commited on

Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
0f4fbd6

Nathan Habib commited on

fix model display when fething metadata
624b3c8

Nathan Habib commited on

reorg to simplify nav in code base
6e56e0d

Clémentine commited on

should update index in collection as it goes
c212cb7

Clémentine commited on

Creating functions for plotting results over time (#295)
f2bc0a5

clefourrier HF staff chriscanal commited on

update collection path
36bf18d

Clémentine commited on

req test
06acefd

Clémentine commited on

added automatic update of the best LLM models
e295ac3

Clémentine commited on