Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
version: 0.0.4 | |
tasks: | |
oab_exams: | |
benchmark: oab_exams | |
col_name: OAB Exams | |
task_list: | |
- oab_exams_generate | |
metric: exact_match | |
few_shot: 5 | |
limit: null | |
baseline: 25.0 | |
human_baseline: 50.0 | |
description: OAB Exams is a dataset of 2,000 questions from the Brazilian Bar | |
Association's exams. | |
link: https://huggingface.co/datasets/eduagarcia/oab_exams | |
brazilian_court_decisions_judgment: | |
benchmark: brazilian_court_decisions_judgment | |
col_name: BR Court Decisions | |
task_list: | |
- brazilian_court_decisions_judgment_generate | |
metric: f1_macro | |
few_shot: 5 | |
limit: null | |
baseline: 33.33 | |
human_baseline: 100.0 | |
description: A classification dataset of court decisions from the Tribunal de | |
Justiça de Alagoas (TJAL, the State Supreme Court of Alagoas (Brazil). | |
link: https://huggingface.co/datasets/joelniklaus/brazilian_court_decisions | |
datalawyer_frases: | |
benchmark: datalawyer_frases | |
col_name: DL Frases | |
task_list: | |
- datalawyer_frases_generate | |
metric: f1_macro | |
few_shot: 15 | |
limit: 2000 | |
baseline: 10.0 | |
human_baseline: 100.0 | |
description: A classification dataset | |
link: https://huggingface.co/datasets/eduagarcia/portuguese_benchmark | |
rrip: | |
benchmark: rrip | |
col_name: RRIP | |
task_list: | |
- rrip_generate | |
metric: f1_macro | |
few_shot: 15 | |
limit: null | |
baseline: 12.5 | |
human_baseline: 100.0 | |
description: A classification dataset | |
link: https://huggingface.co/datasets/eduagarcia/portuguese_benchmark | |