Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
CoreyMorris
/
MMLU-by-task-Leaderboard
like
13
Running
App
Files
Files
Community
4
e3863f2
MMLU-by-task-Leaderboard
4 contributors
History:
128 commits
Corey Morris
Updated contaminated models
e3863f2
11 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
.gitignore
63 Bytes
updated gitignore
11 months ago
.gitmodules
106 Bytes
added hugging face evaluation harness results submodule
12 months ago
README.md
248 Bytes
initial commit
12 months ago
app.py
15.7 kB
Added statement of removal of models
11 months ago
contaminated_models.csv
117 Bytes
Updated contaminated models
11 months ago
contaminated_models.txt
65 Bytes
Updated contaminated models
11 months ago
details_data_processor.py
4.04 kB
updated pipeline and init
11 months ago
requirements.txt
199 Bytes
updated requirements.txt
11 months ago
result_data_processor.py
5.94 kB
removing models that are known to have training data contaminated with evaluations
11 months ago
save_for_regression.py
1.86 kB
changed to save and load in a directory
11 months ago
test_details_data_processing.py
4.33 kB
added a test
11 months ago
test_integration.py
1.96 kB
fixed test_streamlit_app_runs
11 months ago
test_regression.py
1.26 kB
added todo for test
11 months ago
test_result_data_processing.py
1.66 kB
Added organization to dataframe
11 months ago