https://github.com/spydaz
# LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF
This model was converted to GGUF format from [`LeroyDyer/Mixtral_AI_MultiToken`](https://huggingface.co/LeroyDyer/Mixtral_AI_MultiToken) using llama.cpp.
Refer to the [original model card](https://huggingface.co/LeroyDyer/Mixtral_AI_MultiToken) for more details on the model.
## Use with llama.cpp
brew install ggerganov/ggerganov/llama.cpp
llama-cli --hf-repo LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF --model mixtral_ai_multitoken.Q4_K_M.gguf -p "The meaning to life and the universe is "
llama-server --hf-repo LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF --model mixtral_ai_multitoken.Q4_K_M.gguf -c 2048
- Downloads last month
- 306
Inference API (serverless) does not yet support transformers models for this pipeline type.