Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
SUSTech
/
tlem
like
5
Running
App
Files
Files
Community
5
refs/pr/4
tlem
Commit History
fix dataset in task
76eab85
facat
commited on
Nov 27, 2023
add math
d13c0d8
facat
commited on
Nov 27, 2023
FIX: extraction func of C-Eval; logging metrics (
#3
)
25e4875
facat
Cookize
commited on
Nov 25, 2023
Add new benchmark (
#2
)
141ccb9
facat
Cookize
commited on
Nov 25, 2023
update mt_bench
845a45a
facat
commited on
Nov 24, 2023
update
33a6f85
facat
commited on
Nov 12, 2023
fix mmlu
9199665
facat
commited on
Nov 12, 2023
fix fewshot
075ef98
facat
commited on
Nov 12, 2023
verbose mode
a034e31
facat
commited on
Nov 12, 2023
add gsm8k
18cd4ae
facat
commited on
Nov 12, 2023
add mmlu and cmmlu
be1543a
facat
commited on
Oct 29, 2023
upd
044ed98
facat
commited on
Sep 6, 2023
update
69b800b
facat
commited on
Sep 6, 2023
refactor
4c7982b
facat
commited on
Sep 6, 2023
fix name
c250b54
facat
commited on
Sep 6, 2023
add suite
a6d7b1c
facat
commited on
Sep 6, 2023
fix
e01a5f6
facat
commited on
Sep 6, 2023
upd
8af54b8
facat
commited on
Sep 6, 2023
initial commit
507319c
facat
commited on
Aug 29, 2023