webbigdata
/

ALMA-7B-Ja-V2

Text Generation

Transformers

PyTorch

Safetensors

llama

text-generation-inference

Model card Files Files and versions Community

dahara1 commited on Nov 2, 2023

Commit

7607a32

1 Parent(s): ffa99bc

Update README.md

Browse files

Files changed (1) hide show

README.md +14 -10

README.md CHANGED Viewed

@@ -29,6 +29,8 @@ In addition to translation between Japanese and English, the model also has the
 Meta社の200言語以上の翻訳に対応した超多言語対応機械翻訳モデルNLLB-200シリーズと比較したベンチマーク結果は以下です。
 Benchmark results compared to Meta's NLLB-200 series of super multilingual machine translation models, which support translations in over 200 languages, are shown below.
 | Model Name                   | file size |E->J chrf++/F2|E->J comet|J->E chrf++/F2|J->E comet |
 |------------------------------|-----------|--------------|----------|--------------|-----------|
 | NLLB-200-Distilled           | 2.46GB    | 23.6/-       | -        | 50.2/-       | -         |
@@ -37,22 +39,26 @@ Benchmark results compared to Meta's NLLB-200 series of super multilingual machi
 | NLLB-200                     | 17.58GB   | 25.2/-       | -        | 55.1/-       | -         |
 | NLLB-200                     | 220.18GB  | 27.9/33.2    | 0.8908   | 55.8/59.8    | 0.8792    |
-previous our model(ALMA-7B-Ja)
 | Model Name                   | file size |E->J chrf++/F2|E->J comet|J->E chrf++/F2|J->E comet |
 | webbigdata-ALMA-7B-Ja-q4_K_S | 3.6GB     |    -/24.2    | 0.8210   |    -/54.2    | 0.8559    |
 | ALMA-7B-Ja-GPTQ-Ja-En        | 3.9GB     |    -/30.8    | 0.8743   |    -/60.9    | 0.8743    |
 | ALMA-Ja(Ours)                | 13.48GB   |    -/31.8    | 0.8811   |    -/61.6    | 0.8773    |
-ALMA-7B-Ja-V2
 | ALMA-7B-Ja-V2-GPTQ-Ja-En     | 3.9GB     |    -/33.0    | 0.8818   |    -/62.0    | 0.8774    |
 | ALMA-Ja-V2(Ours)             | 13.48GB   |    -/33.9    | 0.8820   |    -/63.1    | 0.8873    |
 | ALMA-Ja-V2-Lora(Ours)        | 13.48GB   |    -/33.7    | 0.8843   |    -/61.1    | 0.8775    |
 様々なジャンルの文章を実際のアプリケーションと比較した結果は以下です。
 Here are the results of a comparison of various genres of writing with the actual application.
-政府の公式文章 Government Official Announcements
 |                          |e->j chrF2++|e->j BLEU|e->j comet|j->e chrF2++|j->e BLEU|j->e comet|
 |--------------------------|------------|---------|----------|------------|---------|----------|
 | ALMA-7B-Ja-V2-GPTQ-Ja-En | 25.3       | 15.00   | 0.8848   | 60.3       | 26.82   | 0.6189   |
@@ -63,7 +69,7 @@ Here are the results of a comparison of various genres of writing with the actua
 | google-translate         | 43.5       | 35.37   | 0.9181   | 62.7       | 29.22   | 0.6446   |
 | deepl                    | 43.5       | 35.74   | 0.9301   | 60.1       | 27.40   | 0.6389   |
-二次創作 Fanfiction
 |                          |e->j chrF2++|e->j BLEU|e->j comet|j->e chrF2++|j->e BLEU|j->e comet|
 |--------------------------|------------|---------|----------|------------|---------|----------|
 | ALMA-7B-Ja-V2-GPTQ-Ja-En | 27.6       | 18.28   | 0.8643   | 52.1       | 24.58   | 0.6106   |
@@ -75,21 +81,20 @@ Here are the results of a comparison of various genres of writing with the actua
 | deepl                    | 33.5       | 28.38   | 0.9094   | 60.0       | 31.14   | 0.6124   |
-[Sample Code For Free Colab](https://github.com/webbigdata-jp/python_sample/blob/main/ALMA_7B_Ja_Free_Colab_sample.ipynb)
 ## Other Version
-### ALMA-7B-Ja-V2^GPTQ-Ja-En
 GPTQ is quantized(reduce the size of the model) method and ALMA-7B-Ja-V2-GPTQ has GPTQ quantized version that reduces model size(3.9GB) and memory usage.
 But the performance is probably lower.  And translation ability for languages other than Japanese and English has deteriorated significantly.
-[Sample Code For Free Colab webbigdata/ALMA-7B-Ja-V2-GPTQ-Ja-En](https://huggingface.co/webbigdata/ALMA-7B-Ja-V2-GPTQ-Ja-En)
 If you want to translate the entire file at once, try Colab below.
-[ALMA_7B_Ja_GPTQ_Ja_En_batch_translation_sample](https://github.com/webbigdata-jp/python_sample/blob/main/ALMA_7B_Ja_GPTQ_Ja_En_batch_translation_sample.ipynb)
 **ALMA** (**A**dvanced **L**anguage **M**odel-based tr**A**nslator) is an LLM-based translation model, which adopts a new translation model paradigm: it begins with fine-tuning on monolingual data and is further optimized using high-quality parallel data. This two-step fine-tuning process ensures strong translation performance.
@@ -110,6 +115,5 @@ Original Model [ALMA-7B](https://huggingface.co/haoranxu/ALMA-7B).  (26.95GB)
 Prevous Model [ALMA-7B-Ja](https://huggingface.co/webbigdata/ALMA-7B-Ja). (13.3 GB)
 ## about this work
 - **This work was done by :** [webbigdata](https://webbigdata.jp/).

 Meta社の200言語以上の翻訳に対応した超多言語対応機械翻訳モデルNLLB-200シリーズと比較したベンチマーク結果は以下です。
 Benchmark results compared to Meta's NLLB-200 series of super multilingual machine translation models, which support translations in over 200 languages, are shown below.
+## NLLB-200
 | Model Name                   | file size |E->J chrf++/F2|E->J comet|J->E chrf++/F2|J->E comet |
 |------------------------------|-----------|--------------|----------|--------------|-----------|
 | NLLB-200-Distilled           | 2.46GB    | 23.6/-       | -        | 50.2/-       | -         |
 | NLLB-200                     | 17.58GB   | 25.2/-       | -        | 55.1/-       | -         |
 | NLLB-200                     | 220.18GB  | 27.9/33.2    | 0.8908   | 55.8/59.8    | 0.8792    |
+## previous our model(ALMA-7B-Ja)
 | Model Name                   | file size |E->J chrf++/F2|E->J comet|J->E chrf++/F2|J->E comet |
+|------------------------------|-----------|--------------|----------|--------------|-----------|
 | webbigdata-ALMA-7B-Ja-q4_K_S | 3.6GB     |    -/24.2    | 0.8210   |    -/54.2    | 0.8559    |
 | ALMA-7B-Ja-GPTQ-Ja-En        | 3.9GB     |    -/30.8    | 0.8743   |    -/60.9    | 0.8743    |
 | ALMA-Ja(Ours)                | 13.48GB   |    -/31.8    | 0.8811   |    -/61.6    | 0.8773    |
+## ALMA-7B-Ja-V2
+| Model Name                   | file size |E->J chrf++/F2|E->J comet|J->E chrf++/F2|J->E comet |
+|------------------------------|-----------|--------------|----------|--------------|-----------|
 | ALMA-7B-Ja-V2-GPTQ-Ja-En     | 3.9GB     |    -/33.0    | 0.8818   |    -/62.0    | 0.8774    |
 | ALMA-Ja-V2(Ours)             | 13.48GB   |    -/33.9    | 0.8820   |    -/63.1    | 0.8873    |
 | ALMA-Ja-V2-Lora(Ours)        | 13.48GB   |    -/33.7    | 0.8843   |    -/61.1    | 0.8775    |
 様々なジャンルの文章を実際のアプリケーションと比較した結果は以下です。
 Here are the results of a comparison of various genres of writing with the actual application.
+## 政府の公式文章 Government Official Announcements
 |                          |e->j chrF2++|e->j BLEU|e->j comet|j->e chrF2++|j->e BLEU|j->e comet|
 |--------------------------|------------|---------|----------|------------|---------|----------|
 | ALMA-7B-Ja-V2-GPTQ-Ja-En | 25.3       | 15.00   | 0.8848   | 60.3       | 26.82   | 0.6189   |
 | google-translate         | 43.5       | 35.37   | 0.9181   | 62.7       | 29.22   | 0.6446   |
 | deepl                    | 43.5       | 35.74   | 0.9301   | 60.1       | 27.40   | 0.6389   |
+## 二次創作 Fanfiction
 |                          |e->j chrF2++|e->j BLEU|e->j comet|j->e chrF2++|j->e BLEU|j->e comet|
 |--------------------------|------------|---------|----------|------------|---------|----------|
 | ALMA-7B-Ja-V2-GPTQ-Ja-En | 27.6       | 18.28   | 0.8643   | 52.1       | 24.58   | 0.6106   |
 | deepl                    | 33.5       | 28.38   | 0.9094   | 60.0       | 31.14   | 0.6124   |
+[Sample Code For Free Colab](https://github.com/webbigdata-jp/python_sample/blob/main/ALMA_7B_Ja_V2_Free_Colab_sample.ipynb)
 ## Other Version
+### ALMA-7B-Ja-V2-GPTQ-Ja-En
 GPTQ is quantized(reduce the size of the model) method and ALMA-7B-Ja-V2-GPTQ has GPTQ quantized version that reduces model size(3.9GB) and memory usage.
 But the performance is probably lower.  And translation ability for languages other than Japanese and English has deteriorated significantly.
+[Sample Code For Free Colab webbigdata/ALMA-7B-Ja-V2-GPTQ-Ja-En](https://github.com/webbigdata-jp/ALMA/blob/master/ALMA_7B_Ja_V2_GPTQ_Ja_En_Free_Colab_sample.ipynb)
 If you want to translate the entire file at once, try Colab below.
+[ALMA_7B_Ja_GPTQ_Ja_En_batch_translation_sample](https://github.com/webbigdata-jp/ALMA/blob/master/ALMA_7B_Ja_V2_GPTQ_Ja_En_batch_translation_sample.ipynb)
 **ALMA** (**A**dvanced **L**anguage **M**odel-based tr**A**nslator) is an LLM-based translation model, which adopts a new translation model paradigm: it begins with fine-tuning on monolingual data and is further optimized using high-quality parallel data. This two-step fine-tuning process ensures strong translation performance.
 Prevous Model [ALMA-7B-Ja](https://huggingface.co/webbigdata/ALMA-7B-Ja). (13.3 GB)
 ## about this work
 - **This work was done by :** [webbigdata](https://webbigdata.jp/).