Commit History

Update src/leaderboard/read_evals.py
09cd30b
verified

eduagarcia commited on

Make model text exibit precision if there's more than one eval or precision is not float16 or bfloat16
59399bc

eduagarcia commited on

fix eval_name for non main revision models
4717ca8

eduagarcia commited on

fix typo and multiple models in README
b4fc70b

eduagarcia commited on

Permit different revision
a3b0a0f

eduagarcia commited on

Fix num_parameters in some models
0c95be4

eduagarcia commited on

Add new column: Main Language
6da7311

eduagarcia commited on

Add raw results links if exists, and fix minor issues
aa7060a

eduagarcia commited on

Add env variable SHOW_INCOMPLETE_EVALS and order evaluation queue by priority
8aaf0e7

eduagarcia commited on

Allow old model metrics
6269bd0

eduagarcia commited on

Add new tasks and make leadboard work without new tasks evals
5639a81

eduagarcia commited on

support hf leaderboard format and my format
a69553b

eduagarcia commited on

Add hidden option
b234783

eduagarcia commited on

Feature: FIELD with original HF Leaderboard ranking
71ecfbb

eduagarcia commited on

Evaluation time metric and plot
359d8a9

eduagarcia commited on

Fix model eval links and remove huggingface icon from Leaderboard name
439afd4

eduagarcia commited on

Refactor code for adding generic tasks
36e3010

eduagarcia commited on

Added check to hide non FINISHED models
d9f882d

Clémentine commited on

simplified display, added an extra config repo to carry dynamic information
9b2e755

Clémentine commited on

wip
0c7ef71

Clémentine commited on

Update src/leaderboard/read_evals.py
3b554b5

clefourrier HF staff commited on

Incorrectly tagged merges are now flagged
90fa47e

Clémentine commited on

Added checkbox for merges
b762711

Clémentine commited on

adding merge check - super slow but at least info is displayed
20b060e

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on