dahara1
/

weblab-10b-instruction-sft-GPTQ

Text Generation

text-generation-inference

Model card Files Files and versions Community

dahara1 commited on Aug 22, 2023

Commit

f218129

·

1 Parent(s): 29880fd

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -12,6 +12,7 @@ The size is smaller and the execution speed is faster, but the inference perform
 ### sample code
 At least one GPU is currently required due to a limitation of the Accelerate library.
 ```
 pip install auto-gptq
@@ -40,13 +41,16 @@ output = model.generate(input_ids=tokens, max_new_tokens=100, do_sample=True, te
 print(tokenizer.decode(output[0]))
 ```
-### See Also
 https://github.com/PanQiWei/AutoGPTQ/blob/main/docs/tutorial/01-Quick-Start.md
 ### Benchmark
-The results below are preliminary. The blank part is under measurement.
 * **Japanese benchmark**

 ### sample code
 At least one GPU is currently required due to a limitation of the Accelerate library.
+So this model cannot be run with the huggingface space free version.
 ```
 pip install auto-gptq
 print(tokenizer.decode(output[0]))
 ```
+### Other documents
 https://github.com/PanQiWei/AutoGPTQ/blob/main/docs/tutorial/01-Quick-Start.md
+### Original Authors
+Takeshi Kojima
 ### Benchmark
+The results below are preliminary. The blank part is under measurement.
+Also, the score may change as a result of tuning after this.
 * **Japanese benchmark**