Pengcheng He
commited on
Commit
•
4d6e70c
1
Parent(s):
02a9971
Add mDeBERTa base model
Browse files
README.md
CHANGED
@@ -1,9 +1,9 @@
|
|
1 |
---
|
2 |
-
language:
|
3 |
tags:
|
4 |
- deberta
|
5 |
- deberta-v3
|
6 |
-
|
7 |
thumbnail: https://huggingface.co/front/thumbnails/microsoft.png
|
8 |
license: mit
|
9 |
---
|
@@ -25,10 +25,10 @@ The mDeBERTa V3 base model comes with 12 layers and a hidden size of 768. Its to
|
|
25 |
|
26 |
We present the dev results on XNLI with zero-shot crosslingual transfer setting, i.e. training with english data only, test with other languages.
|
27 |
|
28 |
-
| Model
|
29 |
-
|
30 |
-
| XLM-R-base
|
31 |
-
| mDeBERTa-base
|
32 |
|
33 |
#### Fine-tuning with HF transformers
|
34 |
|
|
|
1 |
---
|
2 |
+
language: multilingual
|
3 |
tags:
|
4 |
- deberta
|
5 |
- deberta-v3
|
6 |
+
- mdeberta
|
7 |
thumbnail: https://huggingface.co/front/thumbnails/microsoft.png
|
8 |
license: mit
|
9 |
---
|
|
|
25 |
|
26 |
We present the dev results on XNLI with zero-shot crosslingual transfer setting, i.e. training with english data only, test with other languages.
|
27 |
|
28 |
+
| Model | en | fr| es | de | el | bg | ru |tr |ar |vi | th | zh | hi | sw | ur | avg |
|
29 |
+
|--------------|----|----|---- |-- |-- |-- | -- |-- |-- |-- | -- | -- | -- | -- | -- | ----|
|
30 |
+
| XLM-R-base |85.8|79.7|80.7 |78.7 |77.5 |79.6 |78.1 |74.2 |73.8 |76.5 |74.6 |76.7| 72.4| 66.5| 68.3|75.6 |
|
31 |
+
| mDeBERTa-base|88.2|82.6|84.4 |82.7 |82.3 |82.4 |80.8 |79.5 |78.5 |78.1 |76.4 |79.5| 75.9| 73.9| 72.4|**79.8**+/-0.2|
|
32 |
|
33 |
#### Fine-tuning with HF transformers
|
34 |
|