TheBloke
/

gpt4-alpaca-lora-13B-GPTQ

Text2Text Generation

text-generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Apr 28, 2023

Commit

3b565ee

•

1 Parent(s): 8a8a74e

Update README.md

Files changed (1) hide show

README.md +8 -4

README.md CHANGED Viewed

@@ -22,11 +22,15 @@ cd gptq-safe && CUDA_VISIBLE_DEVICES=0 python3 llama.py /content/gpt4-alpaca-lor
 Note that  as `--act-order` was used, this will not work with ooba's fork of GPTQ. You must use the qwopqwop repo as of April 13th.
-Command to clone the correct GPTQ-for-LLaMa repo for inference using `llama_inference.py`, or in `text-generation-webui`:
 ```
-git clone -n  https://github.com/qwopqwop200/GPTQ-for-LLaMa gptq-safe
-cd gptq-safe
-git checkout 58c8ab4c7aaccc50f507fd08cce941976affe5e0
 ```
 There is also a `no-act-order.safetensors` file which will work with oobabooga's fork of GPTQ-for-LLaMa; it does not require the latest GPTQ code.

 Note that  as `--act-order` was used, this will not work with ooba's fork of GPTQ. You must use the qwopqwop repo as of April 13th.
+Command to clone the latest Triton GPTQ-for-LLaMa repo for inference using `llama_inference.py`, or in `text-generation-webui`:
 ```
+# Clone text-generation-webui, if you don't already have it
+git clone https://github.com/oobabooga/text-generation-webui
+# Make a repositories directory
+mkdir -p text-generation-webui/repositories
+cd text-generation-webui/repositories
+# Clone the latest GPTQ-for-LLaMa code inside text-generation-webui
+git clone https://github.com/qwopqwop200/GPTQ-for-LLaMa
 ```
 There is also a `no-act-order.safetensors` file which will work with oobabooga's fork of GPTQ-for-LLaMa; it does not require the latest GPTQ code.