ThomasBaruzier
/

gemma-2-2b-it-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

ThomasBaruzier commited on Jul 31

Commit

f91ea78

•

1 Parent(s): b3d175c

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -19,6 +19,18 @@ extra_gated_prompt: To access Gemma on Hugging Face, you’re required to review
 extra_gated_button_content: Acknowledge license
 ---
 # Gemma Model Card
 **Model Page**: [Gemma](https://ai.google.dev/gemma/docs)

 extra_gated_button_content: Acknowledge license
 ---
+# Llama.cpp imatrix quantizations of google/gemma-2-2b-it-GGUF
+<img src="https://cdn-uploads.huggingface.co/production/uploads/646410e04bf9122922289dc7/-03oAOPVN1nZjp6-2EIxD.png" alt="gemma" width="60%"/>
+Using llama.cpp commit [268c566](https://github.com/ggerganov/llama.cpp/commit/398ede5efeb07b9adf9fbda7ea63f630d476a792) for quantization.
+Original model: https://huggingface.co/google/gemma-2-2b-it
+All quants were made using the imatrix option and Bartowski's [calibration file](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
+<hr><br>
 # Gemma Model Card
 **Model Page**: [Gemma](https://ai.google.dev/gemma/docs)