TheBloke
/

koala-7B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Apr 16, 2023

Commit

eb73cec

·

1 Parent(s): d6aa09e

Update README.md

Files changed (1) hide show

README.md +10 -4

README.md CHANGED Viewed

@@ -31,15 +31,21 @@ I have the following Koala model repositories available:
 * [GPTQ quantized 4bit 7B model in `pt` and `safetensors` formats](https://huggingface.co/TheBloke/koala-7B-GPTQ-4bit-128g)
 * [GPTQ quantized 4bit 7B model in GGML format for `llama.cpp`](https://huggingface.co/TheBloke/koala-7B-GPTQ-4bit-128g-GGML)
 ## Provided files
 Three model files are provided. You don't need all three - choose the one that suits your needs best!
 Details of the files provided:
-* `koala-7B-4bit-128g.pt`
-  * pt format file, created with the latest [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa) code.
-  * Command to create:
-    * `python3 llama.py koala-7B-HF c4 --wbits 4 --true-sequential --act-order --groupsize 128 --save koala-7B-4bit-128g.pt`
 * `koala-7B-4bit-128g.safetensors`
   * newer `safetensors` format, with improved file security, created with the latest [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa) code.
   * Command to create:

 * [GPTQ quantized 4bit 7B model in `pt` and `safetensors` formats](https://huggingface.co/TheBloke/koala-7B-GPTQ-4bit-128g)
 * [GPTQ quantized 4bit 7B model in GGML format for `llama.cpp`](https://huggingface.co/TheBloke/koala-7B-GPTQ-4bit-128g-GGML)
+## GETTING GIBBERISH OUTPUT?
+Please read the sections below carefully.  Gibberish output is expected if you are using the `safetensors` file without the latest GPTQ-for-LLaMa code.
+Your options are either to update GPTQ-for-LLaMa under `text-generation-webui/repositories` to a more recent version, or use the other file provided, `koala-7B-4bit-128g.no-act-order.ooba.pt` which will work immediately.
+Unfortunately right now it is a bit more complext to update GPTQ-for-LLaMa because the most recent code has breaking changes which are not supported by `text-generation-webui`.
+Therefore it's currently recommended to use `koala-7B-4bit-128g.no-act-order.ooba.pt`.
 ## Provided files
 Three model files are provided. You don't need all three - choose the one that suits your needs best!
 Details of the files provided:
 * `koala-7B-4bit-128g.safetensors`
   * newer `safetensors` format, with improved file security, created with the latest [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa) code.
   * Command to create: