Update README.md
Browse files
README.md
CHANGED
@@ -21,7 +21,7 @@ The weights are converted to GGML format using [baichuan13b.cpp](https://github.
|
|
21 |
|ggml-model-q4_1.bin | q4_1 | 8.36 GB |
|
22 |
|ggml-model-q5_0.bin | q5_0 | 9.17 GB |
|
23 |
|ggml-model-q5_1.bin | q5_1 | 9.97 GB |
|
24 |
-
|
25 |
|
26 |
## How to inference
|
27 |
1. [Compile baichuan13b](https://github.com/ouwei2013/baichuan13b.cpp#build), a main executable `baichuan13b/build/bin/main` and a server `baichuan13b/build/bin/server` will be generated.
|
|
|
21 |
|ggml-model-q4_1.bin | q4_1 | 8.36 GB |
|
22 |
|ggml-model-q5_0.bin | q5_0 | 9.17 GB |
|
23 |
|ggml-model-q5_1.bin | q5_1 | 9.97 GB |
|
24 |
+
|ggml-model-q8_0.bin | q8_0 | 14 GB |
|
25 |
|
26 |
## How to inference
|
27 |
1. [Compile baichuan13b](https://github.com/ouwei2013/baichuan13b.cpp#build), a main executable `baichuan13b/build/bin/main` and a server `baichuan13b/build/bin/server` will be generated.
|