Commit History
updated dashboard with new data
443052d
check for URL and full model name
2037152
Corey Morris
commited on
Added clickable links (#1)
59c6dd2
unverified
Corey
commited on
updated description
cb2c32e
Corey Morris
commited on
Added new results
f839734
Corey Morris
commited on
removed example table with link
5d3a9b2
Corey Morris
commited on
changed default scatter plot index
25bce6d
Corey Morris
commited on
Loading new csv with updated data
a0c39f5
Corey Morris
commited on
Moved moral scenarios information higher on page
41d7691
Corey Morris
commited on
changed the wording of moral scenarios
e7c50af
Corey Morris
commited on
stripping the whitespace from the input so that the filtering works with or without whitespace
e1345be
Corey Morris
commited on
Support for multi column filtering using comma seperated values (#2)
246a992
Updated
b601bef
Corey Morris
commited on
Updated description and data
383dc16
Corey Morris
commited on
Completed loaded form csv
8ef77e5
Corey Morris
commited on
loading from csv instead of processing data each time
28e8799
Corey Morris
commited on
WIP. Loading data from csv
1a1910c
Corey Morris
commited on
updated date and model count
0c07f8b
Corey Morris
commited on
Added new hugging face results
3f507e0
Corey Morris
commited on
Updated to reflect number of models. Previously, I think there were duplicates
d396c1e
Corey Morris
commited on
Show a random question from the moral scenarios evaluation
19c7c67
Corey Morris
commited on
Updated model count
4f20e65
Corey Morris
commited on
Added statement of removal of models
96ffe12
Corey Morris
commited on
removed commented code
7fc9618
Corey Morris
commited on
updated update data
280db99
Corey Morris
commited on
Fixed type error
e79bcf3
Corey Morris
commited on
WIP commit. Currently have nlargest error
d506f10
Corey Morris
commited on
Updated the last updated date to 18Aug
42ff7b9
Corey Morris
commited on
Updated description with more models
7f24726
Corey Morris
commited on
fixed error
d7b89ce
Corey Morris
commited on
Added google analytics snippet
9444cd2
Corey Morris
commited on
Increased size of scatter plot
2b16774
Corey Morris
commited on
Made the radar plot larger
f52387e
Corey Morris
commited on
Moved radar plots to higher in the page
12a9766
Corey Morris
commited on
Modified title and explanation to better reflect what the site is
18ec1ba
Corey Morris
commited on
Moved radar chart to after analysis
fb25b1e
Corey Morris
commited on
Added a default model to compare
7b77065
Corey Morris
commited on
Improved clarity of explanation for Radar charts
a450af5
Corey Morris
commited on
Fixed some of the diplicate model issue
618dcce
Corey Morris
commited on
Table now displays the columns that have the top differences
dc21a69
Corey Morris
commited on
removed charts with hardcoded tasks. removed hardcoding of model for other charts
a125eb8
Corey Morris
commited on
Finding top differences between tasks from the target model
627e0f9
Corey Morris
commited on
Added explanation for the plot and a dataframe of the models
2db58a0
Corey Morris
commited on
Added radar chart. Compares a model to the 5 models that have the closest performance on MMLU_average
9695a47
Corey Morris
commited on
Added header back for the table
2a7f691
Corey Morris
commited on
Added citation for the site
ea8703d
Corey Morris
commited on
Changed streamlit to wide layout to see more of the table
1e6b767
Corey Morris
commited on
Updated updated date
28d4d6a
Corey Morris
commited on
Added filter for parameter count. Fixed model filter so that it only filters on the Model name (index of the table)
8474e43
Corey Morris
commited on