cicdatopea
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ base_model:
|
|
6 |
---
|
7 |
## Model Details
|
8 |
|
9 |
-
This model is an int4 model with group_size 128 and
|
10 |
|
11 |
**Please note that loading the model in Transformers can be quite slow. Consider using an alternative serving framework for better performance.**
|
12 |
|
@@ -156,7 +156,7 @@ we have no enough resource to evaluate the model
|
|
156 |
|
157 |
### Generate the model
|
158 |
|
159 |
-
need 200G GPU memory, details will updated later
|
160 |
|
161 |
|
162 |
|
|
|
6 |
---
|
7 |
## Model Details
|
8 |
|
9 |
+
This model is an int4 model with group_size 128 and symmetric quantization of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.
|
10 |
|
11 |
**Please note that loading the model in Transformers can be quite slow. Consider using an alternative serving framework for better performance.**
|
12 |
|
|
|
156 |
|
157 |
### Generate the model
|
158 |
|
159 |
+
need 200G GPU memory, details will be updated later
|
160 |
|
161 |
|
162 |
|