webbigdata
/

ALMA-7B-Ja-V2

Text Generation

text-generation-inference

Model card Files Files and versions Community

dahara1 commited on Nov 3, 2023

Commit

67a66cc

·

1 Parent(s): 6ad4d23

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -123,8 +123,8 @@ Using Colab, Google's free web tool, you can easily verify the performance of AL
 GPTQはモデルサイズを小さくする手法(量子化といいます)です。
 GPTQ is a technique (called quantization) that reduces model size.
-ALMA-7B-Ja-V2-GPTQ-Ja-EnはGPTQを量子化したもので、モデルサイズ(3.9GB)とメモリ使用量を削減し、速度を向上しています。
-ALMA-7B-Ja-V2-GPTQ-Ja-En is a quantized version of GPTQ, which reduces model size (3.9 GB) and memory usage and increases speed.
 ただし、性能は少し落ちてしまいます。また、日本語と英語以外の言語への翻訳能力は著しく低下しているはずです。
 However, performance is slightly reduced. Also, the ability to translate into languages other than Japanese and English should be significantly reduced.

 GPTQはモデルサイズを小さくする手法(量子化といいます)です。
 GPTQ is a technique (called quantization) that reduces model size.
+[ALMA-7B-Ja-V2-GPTQ-Ja-En](https://huggingface.co/webbigdata/ALMA-7B-Ja-V2-GPTQ-Ja-En)はGPTQ量子化版で、モデルサイズ(3.9GB)とメモリ使用量を削減し、速度を向上しています。
+[ALMA-7B-Ja-V2-GPTQ-Ja-En](https://huggingface.co/webbigdata/ALMA-7B-Ja-V2-GPTQ-Ja-En) is a quantized GPTQ version, which reduces model size (3.9 GB) and memory usage and increases speed.
 ただし、性能は少し落ちてしまいます。また、日本語と英語以外の言語への翻訳能力は著しく低下しているはずです。
 However, performance is slightly reduced. Also, the ability to translate into languages other than Japanese and English should be significantly reduced.