jhu-clsp
/

kreyol-mt-pubtrain

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

n8rob commited on May 31

Commit

8509526

•

1 Parent(s): 0ffe8b4

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -74,7 +74,7 @@ And cite our work:
 ## Model hosted here
-This is a many-to-many model for Creole-English, English-Creole and Creole-Creole MT, fine-tuned on top of `facebook/mbart-large-50-many-to-many-mmt`, with only public data.
 Usage:
@@ -82,13 +82,13 @@ Usage:
 from transformers import MBartForConditionalGeneration, AutoModelForSeq2SeqLM
 from transformers import MbartTokenizer, AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("n8rob/kreyol-mt-pubtrain", do_lower_case=False, use_fast=False, keep_accents=True)
-# Or use tokenizer = MbartTokenizer.from_pretrained("n8rob/kreyol-mt-pubtrain", use_fast=False)
-model = AutoModelForSeq2SeqLM.from_pretrained("n8rob/kreyol-mt-pubtrain")
-# Or use model = MBartForConditionalGeneration.from_pretrained("n8rob/kreyol-mt-pubtrain")
 # First tokenize the input and outputs. The format below is how the model was trained so the input should be "Sentence </s> SRCCODE". Similarly, the output should be "TGTCODE Sentence </s>".
 # Example: For Saint Lucian Patois to English translation, we need to use language indicator tags: <2acf> and <2eng> where acf represents Saint Lucian Patois and eng represents English.

 ## Model hosted here
+This is a many-to-many model for translation into and out of Creole languages, fine-tuned on top of `facebook/mbart-large-50-many-to-many-mmt`, with only public data.
 Usage:
 from transformers import MBartForConditionalGeneration, AutoModelForSeq2SeqLM
 from transformers import MbartTokenizer, AutoTokenizer
+tokenizer = AutoTokenizer.from_pretrained("jhu-clsp/kreyol-mt-pubtrain", do_lower_case=False, use_fast=False, keep_accents=True)
+# Or use tokenizer = MbartTokenizer.from_pretrained("jhu-clsp/kreyol-mt-pubtrain", use_fast=False)
+model = AutoModelForSeq2SeqLM.from_pretrained("jhu-clsp/kreyol-mt-pubtrain")
+# Or use model = MBartForConditionalGeneration.from_pretrained("jhu-clsp/kreyol-mt-pubtrain")
 # First tokenize the input and outputs. The format below is how the model was trained so the input should be "Sentence </s> SRCCODE". Similarly, the output should be "TGTCODE Sentence </s>".
 # Example: For Saint Lucian Patois to English translation, we need to use language indicator tags: <2acf> and <2eng> where acf represents Saint Lucian Patois and eng represents English.