mradermacher
commited on
Commit
•
76a93fb
1
Parent(s):
99c859b
auto-patch README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ static quants of https://huggingface.co/elinas/Llama-3-8B-Ultra-Instruct
|
|
20 |
You should use `--override-kv tokenizer.ggml.pre=str:llama3` and a current llama.cpp version to work around a bug in llama.cpp that made these quants. (see https://old.reddit.com/r/LocalLLaMA/comments/1cg0z1i/bpe_pretokenization_support_is_now_merged_llamacpp/?share_id=5dBFB9x0cOJi8vbr-Murh)
|
21 |
|
22 |
<!-- provided-files -->
|
23 |
-
weighted/imatrix quants
|
24 |
## Usage
|
25 |
|
26 |
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
|
|
20 |
You should use `--override-kv tokenizer.ggml.pre=str:llama3` and a current llama.cpp version to work around a bug in llama.cpp that made these quants. (see https://old.reddit.com/r/LocalLLaMA/comments/1cg0z1i/bpe_pretokenization_support_is_now_merged_llamacpp/?share_id=5dBFB9x0cOJi8vbr-Murh)
|
21 |
|
22 |
<!-- provided-files -->
|
23 |
+
weighted/imatrix quants are available at https://huggingface.co/mradermacher/Llama-3-8B-Ultra-Instruct-i1-GGUF
|
24 |
## Usage
|
25 |
|
26 |
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|