embeddinggemma-300m-qat

Model creator: google
Original model: google/embeddinggemma-300m-qat-q4_0-unquantized
GGUF quantization: provided by olegshulyakov using llama.cpp

Special thanks

🙏 Special thanks to Georgi Gerganov and the whole team working on llama.cpp for making all of this possible.

ollama run "hf.co/olegshulyakov/embeddinggemma-300m-qat-GGUF:Q4_0"

lms load "olegshulyakov/embeddinggemma-300m-qat-GGUF"

llama-cli --hf "olegshulyakov/embeddinggemma-300m-qat-GGUF:Q4_0" -p "The meaning to life and the universe is"

llama-server --hf "olegshulyakov/embeddinggemma-300m-qat-GGUF:Q4_0" -c 4096

GGUF

Model size

0.3B params

Architecture

gemma-embedding

Hardware compatibility

4-bit

Base model

Quantized

(6)

this model