Joseph717171
/

Imatrices

Model card Files Files and versions Community

Joseph717171 commited on Mar 20, 2024

Commit

92e895f

·

verified ·

1 Parent(s): 803a4b9

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -126,7 +126,7 @@ cd <llama.cpp directory>
 ```
 4. Use the generated matrix file to quantise the model
 ```
-./quantize --matrix <output.matrix> <model_path>/ggml-model-f16.gguf <quantisation_level, eg:IQ4_XS>
 ```
 Note: normal quantisation also benefits from using a matrix file. It also seem that a bigger input matrix is
 better for higher quantisation.

 ```
 4. Use the generated matrix file to quantise the model
 ```
+./quantize --imatrix <output.matrix> <model_path>/ggml-model-f16.gguf <quantisation_level, eg:IQ4_XS>
 ```
 Note: normal quantisation also benefits from using a matrix file. It also seem that a bigger input matrix is
 better for higher quantisation.