ThomasBaruzier
commited on
Commit
•
f91ea78
1
Parent(s):
b3d175c
Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,18 @@ extra_gated_prompt: To access Gemma on Hugging Face, you’re required to review
|
|
19 |
extra_gated_button_content: Acknowledge license
|
20 |
---
|
21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
# Gemma Model Card
|
23 |
|
24 |
**Model Page**: [Gemma](https://ai.google.dev/gemma/docs)
|
|
|
19 |
extra_gated_button_content: Acknowledge license
|
20 |
---
|
21 |
|
22 |
+
# Llama.cpp imatrix quantizations of google/gemma-2-2b-it-GGUF
|
23 |
+
|
24 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/646410e04bf9122922289dc7/-03oAOPVN1nZjp6-2EIxD.png" alt="gemma" width="60%"/>
|
25 |
+
|
26 |
+
Using llama.cpp commit [268c566](https://github.com/ggerganov/llama.cpp/commit/398ede5efeb07b9adf9fbda7ea63f630d476a792) for quantization.
|
27 |
+
|
28 |
+
Original model: https://huggingface.co/google/gemma-2-2b-it
|
29 |
+
|
30 |
+
All quants were made using the imatrix option and Bartowski's [calibration file](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
|
31 |
+
|
32 |
+
<hr><br>
|
33 |
+
|
34 |
# Gemma Model Card
|
35 |
|
36 |
**Model Page**: [Gemma](https://ai.google.dev/gemma/docs)
|