OPEA
/

DeepSeek-V3-int4-sym-gptq-inc

4-bit precision

Model card Files Files and versions Community

cicdatopea commited on 22 days ago

Commit

cba3df0

·

verified ·

1 Parent(s): c495ad5

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ base_model:
 ---
 ## Model Details
-This model is an int4 model with group_size 128 and and symmetric quantization of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.
 **Please note that loading the model in Transformers can be quite slow. Consider using an alternative serving framework for better performance.**
@@ -156,7 +156,7 @@ we have no enough resource to evaluate the model
 ### Generate the model
-need 200G GPU memory, details will updated later

 ---
 ## Model Details
+This model is an int4 model with group_size 128 and symmetric quantization of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.
 **Please note that loading the model in Transformers can be quite slow. Consider using an alternative serving framework for better performance.**
 ### Generate the model
+need 200G GPU memory, details will be updated later