bullerwins
/

DeepSeek-V3-GGUF

Inference Endpoints

Model card Files Files and versions Community

DeepSeek-V3-GGUF / README.md

bullerwins's picture

Update README.md

2484d36 verified 10 days ago

|

338 Bytes

	---
	base_model:
	- deepseek-ai/DeepSeek-V3
	---
	Initial preview for the GGUF quantized version of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3)

	It needs this PR commit to work: https://github.com/ggerganov/llama.cpp/pull/11049

	Thanks to Fairydreaming for the PR!

	Note: no multi-token prediction (MTP) support