Commit History

Added basic structure of details data processing and testing. For downloading huggingface details dataset files
ee9e25e

Corey Morris commited on

added todo for test
9f7d306

Corey Morris commited on

added a TODO
201a72d

Corey Morris commited on

changed to save and load in a directory
dd61816

Corey Morris commited on

updated gitignore
a89ad93

Corey Morris commited on

Updated regression test
5d87f13

Corey Morris commited on

comparing current code to the saved file from the last commit
ff055eb

Corey Morris commited on

script to save dataframe to a file only if there are no uncommitted files
7a88af3

Corey Morris commited on

Added a first regression test attempt. It currently fails and values are hardcoded
3ec98e7

Corey Morris commited on

fixed test_streamlit_app_runs
5603e9f

Corey Morris commited on

Fixed type error
e79bcf3

Corey Morris commited on

WIP commit. Currently have nlargest error
d506f10

Corey Morris commited on

Added test to test the specific method that is currently producting an error
5b83d0b

Corey Morris commited on

Added failing integration test. Currently fails because of the addition of the organization to the dataframe
de65005

Corey Morris commited on

Added organization to dataframe
52d3b03

Corey Morris commited on

added failing test for new behavior of organization column. Updated test for rows for the newly added rows
02b1702

Corey Morris commited on

removed code to print the number of outliers. could add it back later as logging potentially
cd21f99

Corey Morris commited on

renamed file for clarity
229d6d1

Corey Morris commited on

Added .gitignore file
368802d

Corey Morris commited on

Added new results from hugging face
49d555f

Corey Morris commited on

MC1 column had 8 rows with a value of 1. It didn't make sense given the next highest value was 0.47 . Assuming they were data errors, they were removed
e03b231

Corey Morris commited on

truthfulqa data added to dataframe
abac22e

Corey Morris commited on

Refactor to make later code changes easier
6d41115

Corey Morris commited on

Added test for removal of undesired columns. fixed code error in column removal
9549fcc

Corey Morris commited on

Added initial tests. test_columns and test_rows will be updated or removed later as they test for the exact number of columns and rows. the number of rows will change as more models are added
85667d0

Corey Morris commited on

Updated the last updated date to 18Aug
42ff7b9

Corey Morris commited on

updated with new hugging face results
8ccc242

Corey Morris commited on

Updated description with more models
7f24726

Corey Morris commited on

updated results
80c79bd

Corey Morris commited on

fixed error
d7b89ce

Corey Morris commited on

Added google analytics snippet
9444cd2

Corey Morris commited on

Increased size of scatter plot
2b16774

Corey Morris commited on

Made the radar plot larger
f52387e

Corey Morris commited on

Moved radar plots to higher in the page
12a9766

Corey Morris commited on

Modified title and explanation to better reflect what the site is
18ec1ba

Corey Morris commited on

Moved radar chart to after analysis
fb25b1e

Corey Morris commited on

Added a default model to compare
7b77065

Corey Morris commited on

Improved clarity of explanation for Radar charts
a450af5

Corey Morris commited on

Fixed some of the diplicate model issue
618dcce

Corey Morris commited on

Table now displays the columns that have the top differences
dc21a69

Corey Morris commited on

removed charts with hardcoded tasks. removed hardcoding of model for other charts
a125eb8

Corey Morris commited on

Finding top differences between tasks from the target model
627e0f9

Corey Morris commited on

Added explanation for the plot and a dataframe of the models
2db58a0

Corey Morris commited on

Added radar chart. Compares a model to the 5 models that have the closest performance on MMLU_average
9695a47

Corey Morris commited on

added new results from hugging face
b9b6115

Corey Morris commited on

Added header back for the table
2a7f691

Corey Morris commited on

Added citation for the site
ea8703d

Corey Morris commited on

Changed streamlit to wide layout to see more of the table
1e6b767

Corey Morris commited on

Updated updated date
28d4d6a

Corey Morris commited on

Added filter for parameter count. Fixed model filter so that it only filters on the Model name (index of the table)
8474e43

Corey Morris commited on