Commit History

Updated bibtex citation
bac5383
verified

minseokbae commited on

Update src/display/about.py
e2aca33
verified

ofermend commited on

Update src/display/about.py
3c0cb66
verified

ofermend commited on

Added tags metadata to make the leaderboard more discoverable.
1ad00dd
verified

minseokbae commited on

Update README.md
2a16b2e
verified

ofermend commited on

fixed typo
fa4eaec

Minseok Bae commited on

modified about.py
818ee3d

Minseok Bae commited on

Modified about.py so that it displays (%) in columns.
5bcc476

Minseok Bae commited on

Fixed the leaderboard filtering functionality. Modified filter_models() function in app.py/
1f26f6c

Minseok Bae commited on

modified the evaluation pipelines.
2c24f05

Minseok Bae commited on

Added citations
b46b972

Minseok Bae commited on

Updated about.py
dbcffd4

Minseok Bae commited on

Edited README and added reproducibility functionality in main_backend.py
f0b90cf

Minseok Bae commited on

modified read_evals.py
c3e9147

Minseok Bae commited on

Refine the code style
156ef43

Minseok Bae commited on

Implemented litellm pipeline
2864204

Minseok Bae commited on

Edited README and removed error-rate metric
404587d

Minseok Bae commited on

modified is_model_on_hub()
3b66490

Minseok Bae commited on

changed back to TOKEN
0c85a8e

Minseok Bae commited on

changed to HF_TOKEN
a9a1c18

Minseok Bae commited on

modified check_validity.py and added sample dataset to test functionality
099e4e2

Minseok Bae commited on

Integrated backend pipelines - error occurs during model submission. (Debugging needed).
58b9de9

Minseok Bae commited on

Modified for hallucination evaluation task
d7b7dc6

Minseok Bae commited on

Update README.md
767187a

ofermend commited on

Update src/display/about.py
0baf5c4

ofermend commited on

update read
943f952

Clémentine commited on

fixs
314f91a

Clémentine commited on

fix
1257fc3

Clémentine commited on

updated leaderboard
efeee6d

Clémentine commited on

Simplified leaderboard v0
9833cdb

Clémentine commited on

adding pull back
d084b26

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

Added check on tokenizer to prevent submissions which won't run
7302987

Clémentine commited on

Update benchmark count and fix typo (`inetuning->finetuning`) (#395)
7abc6a7

clefourrier HF staff alvarobartt HF staff commited on

Update README.md
96d111a

clefourrier HF staff commited on

make faster thanks to no concurrency limit
d4aa996

Clémentine commited on

fix order of request file vs request file list, to avoid resubmitting issues
976f398

Clémentine commited on

cache
4ff9eef

Clémentine commited on

update for caching
395eff6

Clémentine commited on

simplify launcher + remove dataframe warning on boolean columns
ab6f548

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Simplify About
eaace79

Clémentine commited on

Try concurrency management
bb149ba

Clémentine commited on

up sdk
d45f810

Clémentine commited on

fix
be0d7e4

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

fix value error in param size
ccefec9

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

Update app.py
a163e5c

clefourrier HF staff commited on

req
c5938bb

Clémentine commited on