Update README.md
Browse files
README.md
CHANGED
@@ -49,7 +49,7 @@ classifier(sequence_to_classify, candidate_labels, hypothesis_template=hypothesi
|
|
49 |
|
50 |
|
51 |
## Training
|
52 |
-
This model was pre-trained on a set of 100 languages and follwed further training on 198M multilingual tweets as described in the original [paper](https://arxiv.org/abs/2104.12250). Further it was trained on the training set of XNLI dataset in german which is a machine translated version of the MNLI dataset. It was trained on 5 epochs of the XNLI train set and evaluated on the XNLI eval dataset at the end of every to find the best performing model. The model which had the highest accuracy on the eval set was chosen at the end.
|
53 |
|
54 |
![Training Charts from wandb](screen_wandb.png)
|
55 |
- learning rate: 2e-5
|
|
|
49 |
|
50 |
|
51 |
## Training
|
52 |
+
This model was pre-trained on a set of 100 languages and follwed further training on 198M multilingual tweets as described in the original [paper](https://arxiv.org/abs/2104.12250). Further it was trained on the training set of XNLI dataset in german which is a machine translated version of the MNLI dataset. It was trained on 5 epochs of the XNLI train set and evaluated on the XNLI eval dataset at the end of every epoch to find the best performing model. The model which had the highest accuracy on the eval set was chosen at the end.
|
53 |
|
54 |
![Training Charts from wandb](screen_wandb.png)
|
55 |
- learning rate: 2e-5
|