open_pt_llm_leaderboard

Running on CPU Upgrade

Add env variable SHOW_INCOMPLETE_EVALS and order evaluation queue by priority

8aaf0e7 9 months ago

454 Bytes

	CHANGELOG_TEXT = f"""
	# Changes made to the leaderboard

	### [1.1.0] - 2024-02-16
	Removed the Sparrow POR benchmark from the leaderboard because of low quality annotations
	Added HateBR Offensive, PT Hate Speech and tweetSentBR benchmarks to the leaderboard, started new evaluation queue for these benchmarks

	### [1.0.0] - 2024-02-01
	Protype version launched with 7 benchmarks ENEM, BLUEX, OAB Exams, ASSIN 2 RTE and STS, FAQUAD NLI and SPARROW POR
	"""