Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
CoreyMorris
/
MMLU-by-task-Leaderboard
like
13
Running
App
Files
Files
Community
4
19c7c67
MMLU-by-task-Leaderboard
4 contributors
History:
136 commits
Corey Morris
Show a random question from the moral scenarios evaluation
19c7c67
11 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
.gitignore
63 Bytes
updated gitignore
11 months ago
.gitmodules
106 Bytes
added hugging face evaluation harness results submodule
12 months ago
README.md
248 Bytes
initial commit
12 months ago
app.py
16.1 kB
Show a random question from the moral scenarios evaluation
11 months ago
contaminated_models.csv
117 Bytes
Updated contaminated models
11 months ago
contaminated_models.txt
65 Bytes
Updated contaminated models
11 months ago
details_data_processor.py
4.04 kB
updated pipeline and init
11 months ago
dev_requirements.txt
130 Bytes
updated dev requirements
11 months ago
moral_scenarios_questions.csv
370 kB
Show a random question from the moral scenarios evaluation
11 months ago
requirements.txt
199 Bytes
updated requirements.txt
11 months ago
result_data_processor.py
6.19 kB
Returning just a single file per model directory. Manually removing gpt-j-6b for now because there is something that is causing problems with processing the data
11 months ago
save_for_regression.py
1.86 kB
changed to save and load in a directory
11 months ago
test_details_data_processing.py
4.33 kB
added a test
11 months ago
test_integration.py
1.96 kB
fixed test_streamlit_app_runs
11 months ago
test_regression.py
1.26 kB
added todo for test
11 months ago
test_result_data_processing.py
1.66 kB
Added organization to dataframe
11 months ago