TheBloke
/

Mixtral-8x7B-Instruct-v0.1-GGUF

Model card Files Files and versions Community

TheBloke commited on Dec 13, 2023

Commit

66f8749

•

1 Parent(s): 00e013d

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -70,7 +70,7 @@ I have tested CUDA acceleration and it works great. I have not yet tested other
 ## Prompt template: Mistral
 ```
-<s>[INST] {prompt} [/INST]
 ```
 <!-- prompt-template end -->
@@ -171,7 +171,7 @@ Windows Command Line users: You can set the environment variable by running `set
 Make sure you are using `llama.cpp` from commit [d0cee0d](https://github.com/ggerganov/llama.cpp/commit/d0cee0d36d5be95a0d9088b674dbb27354107221) or later.
 ```shell
-./main -ngl 35 -m mixtral-8x7b-instruct-v0.1.Q4_K_M.gguf --color -c 32768 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "<s>[INST] {prompt} [/INST]"
 ```
 Change `-ngl 32` to the number of layers to offload to GPU. Remove it if you don't have GPU acceleration.

 ## Prompt template: Mistral
 ```
+[INST] {prompt} [/INST]
 ```
 <!-- prompt-template end -->
 Make sure you are using `llama.cpp` from commit [d0cee0d](https://github.com/ggerganov/llama.cpp/commit/d0cee0d36d5be95a0d9088b674dbb27354107221) or later.
 ```shell
+./main -ngl 35 -m mixtral-8x7b-instruct-v0.1.Q4_K_M.gguf --color -c 32768 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "[INST] {prompt} [/INST]"
 ```
 Change `-ngl 32` to the number of layers to offload to GPU. Remove it if you don't have GPU acceleration.