Spaces:

CoreyMorris
/

MMLU-by-task-Leaderboard

Running

App Files Files Community

MMLU-by-task-Leaderboard

4 contributors

History: 136 commits

Corey Morris

Show a random question from the moral scenarios evaluation

19c7c67 11 months ago

.gitattributes

1.52 kB

initial commit 12 months ago
.gitignore

63 Bytes

updated gitignore 11 months ago
.gitmodules

106 Bytes

added hugging face evaluation harness results submodule 12 months ago
README.md

248 Bytes

initial commit 12 months ago
app.py

16.1 kB

Show a random question from the moral scenarios evaluation 11 months ago
contaminated_models.csv

117 Bytes

Updated contaminated models 11 months ago
contaminated_models.txt

65 Bytes

Updated contaminated models 11 months ago
details_data_processor.py

4.04 kB

updated pipeline and init 11 months ago
dev_requirements.txt

130 Bytes

updated dev requirements 11 months ago
moral_scenarios_questions.csv

370 kB

Show a random question from the moral scenarios evaluation 11 months ago
requirements.txt

199 Bytes

updated requirements.txt 11 months ago
result_data_processor.py

6.19 kB

Returning just a single file per model directory. Manually removing gpt-j-6b for now because there is something that is causing problems with processing the data 11 months ago
save_for_regression.py

1.86 kB

changed to save and load in a directory 11 months ago
test_details_data_processing.py

4.33 kB

added a test 11 months ago
test_integration.py

1.96 kB

fixed test_streamlit_app_runs 11 months ago
test_regression.py

1.26 kB

added todo for test 11 months ago
test_result_data_processing.py

1.66 kB

Added organization to dataframe 11 months ago