eduagarcia's picture
Add env variable SHOW_INCOMPLETE_EVALS and order evaluation queue by priority
8aaf0e7
raw
history blame
454 Bytes
CHANGELOG_TEXT = f"""
# Changes made to the leaderboard
### [1.1.0] - 2024-02-16
Removed the Sparrow POR benchmark from the leaderboard because of low quality annotations
Added HateBR Offensive, PT Hate Speech and tweetSentBR benchmarks to the leaderboard, started new evaluation queue for these benchmarks
### [1.0.0] - 2024-02-01
Protype version launched with 7 benchmarks ENEM, BLUEX, OAB Exams, ASSIN 2 RTE and STS, FAQUAD NLI and SPARROW POR
"""