OPEA
/

DeepSeek-V3-int4-sym-gptq-inc

4-bit precision

Model card Files Files and versions Community

cicdatopea commited on 22 days ago

Commit

c004b1b

·

verified ·

1 Parent(s): df9474a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ This model is an int4 model with group_size 128 and and symmetric quantization o
 **Please note that loading the model in Transformers can be quite slow. Consider using an alternative serving framework for better performance.**
-Due to limited GPU resources, we have only tested a few prompts on a CPU backend using QBits. If you found this modelnot perform well, **you can explore a quantized model in AWQ format with different hyperparameters generated via AutoRound** which will be uploaded soon
 Please follow the license of the original model.

 **Please note that loading the model in Transformers can be quite slow. Consider using an alternative serving framework for better performance.**
+Due to limited GPU resources, we have only tested a few prompts on a CPU backend with QBits. If this model does not meet your performance expectations, you may explore another quantized model in AWQ format, generated via AutoRound with different hyperparameters. This alternative model will be uploaded soon.
 Please follow the license of the original model.