bullerwins
/

DeepSeek-V3-GGUF

Inference Endpoints

Model card Files Files and versions Community

DeepSeek-V3-GGUF / README.md

bullerwins's picture

Update README.md

ca7e68f verified 10 days ago

|

544 Bytes

	---
	base_model:
	- deepseek-ai/DeepSeek-V3
	---

	UPDATE Jan 4th 2025: Support for DeepSeek-V3 has been merged, you can now pull from the master branch. The versions uploaded in the repo are already requanted to support the changes in the tensor names


	Initial preview for the GGUF quantized version of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3)

	It needs this PR commit to work: https://github.com/ggerganov/llama.cpp/pull/11049

	Thanks to Fairydreaming for the PR!

	Note: no multi-token prediction (MTP) support