wenqiglantz
/

Mistral-7B-v0.1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

wenqiglantz commited on Jan 15

Commit

e765e2b

•

1 Parent(s): b57110e

Upload 2 files

Files changed (2) hide show

README.md +27 -0
config.json +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,27 @@

+---
+license: apache-2.0
+pipeline_tag: text-generation
+tags:
+- finetuned
+inference: true
+base_model: mistralai/Mistral-7B-Instruct-v0.2
+model_creator: Mistral AI_
+model_name: Mistral 7B Instruct v0.2
+model_type: mistral
+prompt_template: '<s>[INST] {prompt} [/INST]
+  '
+quantized_by: wenqiglantz
+---
+# Mistral 7B Instruct v0.2 - GGUF
+This is a quantized model for `mistralai/Mistral-7B-Instruct-v0.2`. Two quantization methods were used:
+- Q5_K_M: 5-bit, preserves most of the model's performance
+- Q4_K_M: 4-bit, smaller footprints and saves more memory
+<!-- description start -->
+## Description
+This repo contains GGUF format model files for [Mistral AI_'s Mistral 7B Instruct v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2).
+This model was quantized in Google Colab.

config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+    "model_type": "mistral"
+}