base_model: | |
- deepseek-ai/DeepSeek-V3 | |
Initial preview for the GGUF quantized version of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) | |
It needs this PR commit to work: https://github.com/ggerganov/llama.cpp/pull/11049 | |
Thanks to Fairydreaming for the PR! | |
Note: no multi-token prediction (MTP) support |