Commit History

Added new hugging face results
3f507e0

Corey Morris commited on

added a test and removed the code to only test a specific file because that code did not work
6ed8672

Corey Morris commited on

updated to run submodule update
25d217c

Corey Morris commited on

Update pytest run to only run specific test files. Other test files are not ready to be run on a different system yet
9345a86

Corey Morris commited on

Merge branch 'main' of https://github.com/c1505/LLM-Dashboard into main
0e575e0

Corey Morris commited on

Added additional results
7863417

Corey Morris commited on

Updated to reflect number of models. Previously, I think there were duplicates
d396c1e

Corey Morris commited on

Create python-app.yml
063ba51
unverified

Corey commited on

Updated dependencies
73da8d6

Corey Morris commited on

Show a random question from the moral scenarios evaluation
19c7c67

Corey Morris commited on

Returning just a single file per model directory. Manually removing gpt-j-6b for now because there is something that is causing problems with processing the data
794b32b

Corey Morris commited on

added new results
324764c

Corey Morris commited on

TEMPORARY. deleted gpt-j-6b from subdirectory until problems are fixed
1fef386

Corey Morris commited on

updated results
aba4fe2

Corey Morris commited on

updated dev requirements
7681250

Corey Morris commited on

added dev requirmenents
885ecf8

Corey Morris commited on

Updated model count
4f20e65

Corey Morris commited on

Updated contaminated models
e3863f2

Corey Morris commited on

Added statement of removal of models
96ffe12

Corey Morris commited on

removed commented code
7fc9618

Corey Morris commited on

updated update data
280db99

Corey Morris commited on

removing models that are known to have training data contaminated with evaluations
a5840fb

Corey Morris commited on

updated with new hugging face results
916604b

Corey Morris commited on

updated pipeline and init
7f2d984

Corey Morris commited on

removed commented code
2f457d8

Corey Morris commited on

added a test
a13887a

Corey Morris commited on

shortened file name
7622af3

Corey Morris commited on

shortened file name
38d88f9

Corey Morris commited on

using URL as file name
25b87bf

Corey Morris commited on

WIP. Updated download file. Can now download all files. Need to integrate that code to loop through all files to download or combine files first into a single dataframe and then save that
0a77c60

Corey Morris commited on

added new test for a file that currently can be downloaded
6251f5a

Corey Morris commited on

Replicating 404 error with a test so I can troubleshoot
9adae3c

Corey Morris commited on

Updated download_file method
b58e1f0

Corey Morris commited on

Build URL from file path is working
cc32c4f

Corey Morris commited on

moved methods to better match flow
f228d38

Corey Morris commited on

removed most commented out code from details processor
74822dd

Corey Morris commited on

Find files is working as expected
30fa96a

Corey Morris commited on

WIP commit. Finding files can be identical as the method in results_data_processor.
c32735e

Corey Morris commited on

added mostly hardcoded generate url method and test
83a34f0

Corey Morris commited on

Added download file method and test
513e813

Corey Morris commited on

Added basic structure of details data processing and testing. For downloading huggingface details dataset files
ee9e25e

Corey Morris commited on

added todo for test
9f7d306

Corey Morris commited on

added a TODO
201a72d

Corey Morris commited on

changed to save and load in a directory
dd61816

Corey Morris commited on

updated gitignore
a89ad93

Corey Morris commited on

Updated regression test
5d87f13

Corey Morris commited on

comparing current code to the saved file from the last commit
ff055eb

Corey Morris commited on

script to save dataframe to a file only if there are no uncommitted files
7a88af3

Corey Morris commited on

Added a first regression test attempt. It currently fails and values are hardcoded
3ec98e7

Corey Morris commited on

fixed test_streamlit_app_runs
5603e9f

Corey Morris commited on