Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
CoreyMorris
/
MMLU-by-task-Leaderboard
like
13
Sleeping
App
Files
Files
Community
4
02b1702
MMLU-by-task-Leaderboard
4 contributors
History:
90 commits
Corey Morris
added failing test for new behavior of organization column. Updated test for rows for the newly added rows
02b1702
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
.gitignore
Safe
44 Bytes
Added .gitignore file
over 1 year ago
.gitmodules
Safe
106 Bytes
added hugging face evaluation harness results submodule
over 1 year ago
README.md
Safe
248 Bytes
initial commit
over 1 year ago
app.py
Safe
15.6 kB
Updated the last updated date to 18Aug
over 1 year ago
requirements.txt
Safe
199 Bytes
updated requirements.txt
over 1 year ago
result_data_processor.py
5.27 kB
removed code to print the number of outliers. could add it back later as logging potentially
over 1 year ago
test_result_data_processing.py
1.66 kB
added failing test for new behavior of organization column. Updated test for rows for the newly added rows
over 1 year ago