Rohith04
/

ct2fast_m2m100_418M

Inference Endpoints

Model card Files Files and versions Community

Rohith04 commited on Jan 19

Commit

ae041c3

•

1 Parent(s): f40dc73

Added better readme

Files changed (1) hide show

README.md +36 -0

README.md CHANGED Viewed

@@ -1,3 +1,39 @@
 ---
 license: mit
 ---

 ---
 license: mit
+library_name: adapter-transformers
+tags:
+- Traslation
+- CTranslate2
 ---
+# Quantized M2M100 for Fast Translation with CTranslate2
+This model is a quantized version of the [M2M100 418M model](https://huggingface.co/facebook/m2m100_418M) from Facebook AI, optimized for fast inference using CTranslate2. It supports translation between 100 languages with significantly improved speed compared to the original model.
+## Key Features
+- **Quantization:** The model is quantized to 8-bit integers, reducing model size and accelerating inference.
+- **CTranslate2:** Leverages CTranslate2 for efficient C++-based inference, further boosting speed.
+- **Multi-Language Support:** Translates between 100 languages, covering a wide range of linguistic needs.
+## Useage
+```py
+import ctranslate2
+import transformers
+translator = ctranslate2.Translator("Rohith04/ct2fast_m2m100_418M")
+tokenizer = transformers.AutoTokenizer.from_pretrained("facebook/m2m100_418M")
+tokenizer.src_lang = "en"
+source = tokenizer.convert_ids_to_tokens(tokenizer.encode("Hello world!"))
+target_prefix = [tokenizer.lang_code_to_token["de"]]
+results = translator.translate_batch([source], target_prefix=[target_prefix])
+target = results[0].hypotheses[0][1:]
+print(tokenizer.decode(tokenizer.convert_tokens_to_ids(target)))
+```
+## Resources
+Original model: https://huggingface.co/facebook/m2m100_418M
+CTranslate2: https://github.com/OpenNMT/CTranslate2