projecte-aina
/

aina-translator-zh-ca

Model card Files Files and versions Community

xixianliao commited on 24 days ago

Commit

e136825

•

1 Parent(s): 7fee9d1

Upload model

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ It is trained on a combination of Catalan-Chinese datasets
 totalling 94.187.858 sentence pairs. 113.305 sentence pairs were parallel data collected from the web, while the remaining 94.074.553 sentence pairs
 were parallel synthetic data created using the
 [Aina Project's Spanish-Catalan machine translation model](https://huggingface.co/projecte-aina/aina-translator-es-ca) and the [Aina Project's English-Catalan machine translation model](https://huggingface.co/projecte-aina/aina-translator-en-ca).
-The model was evaluated on the Flores, NTREX, and Projecte Aina's Catalan-Chinese evaluation datasets.
 ## Intended uses and limitations
@@ -137,7 +137,7 @@ Weights were saved every 500 updates.
 Below are the evaluation results on [Flores-200](https://github.com/facebookresearch/flores/tree/main/flores200),
 [NTREX](https://github.com/MicrosoftTranslator/NTREX), and Projecte Aina's Catalan-Chinese test sets, compared to Google Translate for the ZH-CA direction. The evaluation was conducted [`tower-eval`](https://github.com/deep-spin/tower-eval) following the standard setting (beam search with beam size 5, limiting the translation length to 200 tokens). We report the following metrics:
-- BLEU: Sacrebleu implementation, version:2.4.0.
 - ChrF: Sacrebleu implementation.
 - Comet: Model checkpoint: "Unbabel/wmt22-comet-da".
 - Comet-kiwi: Model checkpoint: "Unbabel/wmt22-cometkiwi-da".

 totalling 94.187.858 sentence pairs. 113.305 sentence pairs were parallel data collected from the web, while the remaining 94.074.553 sentence pairs
 were parallel synthetic data created using the
 [Aina Project's Spanish-Catalan machine translation model](https://huggingface.co/projecte-aina/aina-translator-es-ca) and the [Aina Project's English-Catalan machine translation model](https://huggingface.co/projecte-aina/aina-translator-en-ca).
+The model was evaluated on the Flores, NTREX, and Projecte Aina's Catalan-Chinese evaluation datasets, achieving results comparable to those of Google Translate.
 ## Intended uses and limitations
 Below are the evaluation results on [Flores-200](https://github.com/facebookresearch/flores/tree/main/flores200),
 [NTREX](https://github.com/MicrosoftTranslator/NTREX), and Projecte Aina's Catalan-Chinese test sets, compared to Google Translate for the ZH-CA direction. The evaluation was conducted [`tower-eval`](https://github.com/deep-spin/tower-eval) following the standard setting (beam search with beam size 5, limiting the translation length to 200 tokens). We report the following metrics:
+- BLEU: Sacrebleu implementation, version: 2.4.0.
 - ChrF: Sacrebleu implementation.
 - Comet: Model checkpoint: "Unbabel/wmt22-comet-da".
 - Comet-kiwi: Model checkpoint: "Unbabel/wmt22-cometkiwi-da".