jarodrigues
commited on
Commit
•
098f75b
1
Parent(s):
f0d3318
Update README.md
Browse files
README.md
CHANGED
@@ -129,9 +129,9 @@ For testing, we reserved the translated datasets MRPC (similarity) and RTE (infe
|
|
129 |
|
130 |
| Model | MRPC (F1) | RTE (F1) | COPA (F1) |
|
131 |
|--------------------------|----------------|----------------|-----------|
|
132 |
-
| **Gervásio 7B PTBR**
|
133 |
-
| **LLaMA-2**
|
134 |
-
| **LLaMA-2 Chat**
|
135 |
<br>
|
136 |
|
137 |
For further testing our decoder, in addition to the testing data described above, we also reused some of the datasets that had been resorted for American Portuguese to test the state-of-the-art Sabiá model and that were originally developed with materials from Portuguese: ASSIN2 RTE (entailment) and ASSIN2 STS (similarity), BLUEX (question answering), ENEM 2022 (question answering) and FaQuAD (extractive question-answering).
|
|
|
129 |
|
130 |
| Model | MRPC (F1) | RTE (F1) | COPA (F1) |
|
131 |
|--------------------------|----------------|----------------|-----------|
|
132 |
+
| **Gervásio 7B PTBR** | **0.7822** | **0.8321** | 0.2134 |
|
133 |
+
| **LLaMA-2 (EN)** | 0.0369 | 0.0516 | 0.4867 |
|
134 |
+
| **LLaMA-2 Chat (EN)** | 0.5432 | 0.3807 | **0.5493**|
|
135 |
<br>
|
136 |
|
137 |
For further testing our decoder, in addition to the testing data described above, we also reused some of the datasets that had been resorted for American Portuguese to test the state-of-the-art Sabiá model and that were originally developed with materials from Portuguese: ASSIN2 RTE (entailment) and ASSIN2 STS (similarity), BLUEX (question answering), ENEM 2022 (question answering) and FaQuAD (extractive question-answering).
|