leafspark
/

DeepSeek-V2-Chat-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

leafspark commited on May 22

Commit

42e3696

•

1 Parent(s): 440314c

readme: TODO implement llamafile

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -19,6 +19,8 @@ Quantizised from [https://huggingface.co/deepseek-ai/DeepSeek-V2-Chat](https://h
 Using llama.cpp fork: [https://github.com/fairydreaming/llama.cpp/tree/deepseek-v2](https://github.com/fairydreaming/llama.cpp/tree/deepseek-v2)
 # Warning: This will not work unless you compile llama.cpp from the repo provided (and set metadata KV overrides)!
 # How to use:

 Using llama.cpp fork: [https://github.com/fairydreaming/llama.cpp/tree/deepseek-v2](https://github.com/fairydreaming/llama.cpp/tree/deepseek-v2)
+TODO: Make llamafile for Q2_K and Q4_K_M
 # Warning: This will not work unless you compile llama.cpp from the repo provided (and set metadata KV overrides)!
 # How to use: