File size: 999 Bytes
0fd5a99 d56e5d7 0fd5a99 d56e5d7 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
---
license: apache-2.0
datasets:
- inseq/scat
- gsarti/iwslt2017_context
language:
- en
- fr
pipeline_tag: translation
tags:
- arxiv:2310.01188
- contextual-mt
- document-mt
---
*This model corresponds to the [`Helsinki-NLP/opus-mt-tc-big-en-fr`](https://huggingface.co/Helsinki-NLP/opus-mt-tc-big-en-fr) further trained on English-to-French translation on the [IWSLT17 dataset](https://huggingface.co/datasets/gsarti/iwslt2017_context) with context tags using the format:
```
Input: SOURCE_CTX <brk> SOURCE_CURR
Output: TARGET_CTX <brk> TARGET_CURR
```
and further fine-tuned on the training split of [SCAT+](https://huggingface.co/datasets/inseq/scat). The model was used in the evaluation of the paper [Quantifying the Plausibility of Context Reliance in Neural Machine Translation](https://openreview.net/forum?id=XTHfNGI3zT) published at ICLR 2024, also available on [Arxiv](https://arxiv.org/abs/2310.01188). It can be used for English to French contextual and non-contextual translation. |