namtran
/

Mistral-7b-v0.2-AWQ-GGUF

Model card Files Files and versions Community

Mistral-7b-v0.2-AWQ-GGUF / README.md

namtran's picture

Update README.md

81e31f7 verified 12 months ago

|

history blame contribute delete

572 Bytes

	---
	inference: false
	license: other
	model_type: llama
	---
	# Mistral 7B v0.2 - AWQ GGUF
	These files are in GGUF format.
	- Model creator: [Mistralai](https://huggingface.co/mistralai)
	- Original model: [Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)

	The model was converted by the combination of [llama.cpp](https://github.com/ggerganov/llama.cpp) and quantization method [AWQ](https://github.com/mit-han-lab/llm-awq)

	## How to use models in `llama.cpp`

	```
	./main -m Mistral-7b-v0.1-Q2_K.gguf -n 128 --prompt "Once upon a time"
	```