Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,39 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
base_model:
|
4 |
+
- mistralai/Mistral-7B-Instruct-v0.3
|
5 |
+
---
|
6 |
+
|
7 |
+
|
8 |
+
# mistralai/Mistral-7B-Instruct-v0.3
|
9 |
+
- ## Introduction
|
10 |
+
- Quantization Tool: Quark 0.6.0
|
11 |
+
- OGA Model Builder: v0.5.1
|
12 |
+
- ## Quantization Strategy
|
13 |
+
- AWQ / Group 128 / Asymmetric / UINT4 Weights / FP16 activations
|
14 |
+
- Excluded Layers: None
|
15 |
+
```
|
16 |
+
python3 quantize_quark.py \
|
17 |
+
--model_dir "$model" \
|
18 |
+
--output_dir "$output_dir" \
|
19 |
+
--quant_scheme w_uint4_per_group_asym \
|
20 |
+
--num_calib_data 128 \
|
21 |
+
--quant_algo awq \
|
22 |
+
--dataset pileval_for_awq_benchmark \
|
23 |
+
--seq_len 512 \
|
24 |
+
--model_export quark_safetensors \
|
25 |
+
--data_type float16 \
|
26 |
+
--exclude_layers [] \
|
27 |
+
--custom_mode awq
|
28 |
+
```
|
29 |
+
- ## OGA Model Builder
|
30 |
+
```
|
31 |
+
python builder.py \
|
32 |
+
-i <quantized safetensor model dir> \
|
33 |
+
-o <oga model output dir> \
|
34 |
+
-p int4 \
|
35 |
+
-e dml
|
36 |
+
```
|
37 |
+
- PostProcessed to generate Hybrid Model
|
38 |
+
-
|
39 |
+
|