Update README.md
Browse files
README.md
CHANGED
@@ -34,6 +34,7 @@ The following specifications:
|
|
34 |
| Model | Quantized | Size | Context | Hardware Requirement |
|
35 |
|-------------|-----------|--------|--------------------------| --------------------------|
|
36 |
| APUS-xDAN4.0-MoE-0402.Q2_K.gguf | Q2_K | 39G | 32k | 2x24G GPU memory |
|
|
|
37 |
| APUS-xDAN4.0-MoE-0402.Q3_K_M_Matrix.gguf | Q3_K_M | 51G | 32k | 2x24G GPU memory |
|
38 |
| APUS-xDAN4.0-MoE-0402.Q4_K_M.gguf | Q4_K_M | 64G | 32k | 3x24G GPU memory |
|
39 |
| | | | | 4x80G GPU memory |
|
|
|
34 |
| Model | Quantized | Size | Context | Hardware Requirement |
|
35 |
|-------------|-----------|--------|--------------------------| --------------------------|
|
36 |
| APUS-xDAN4.0-MoE-0402.Q2_K.gguf | Q2_K | 39G | 32k | 2x24G GPU memory |
|
37 |
+
| APUS-xDAN4.0-MoE-0402.IQ3_XXS.gguf | IQ3_XXS | 41G | 32k | 2x24G GPU memory |
|
38 |
| APUS-xDAN4.0-MoE-0402.Q3_K_M_Matrix.gguf | Q3_K_M | 51G | 32k | 2x24G GPU memory |
|
39 |
| APUS-xDAN4.0-MoE-0402.Q4_K_M.gguf | Q4_K_M | 64G | 32k | 3x24G GPU memory |
|
40 |
| | | | | 4x80G GPU memory |
|