Edit model card

Kosmos-8B-v1 GGUF Quantizations πŸ—²

The serenity of infinity is not the end.

KosmosLogo256.png

This model was converted to GGUF format using llama.cpp.

For more information of the model, see the original model card: Khetterman/Kosmos-8B-v1.

Available Quantizations (β—•β€Ώβ—•)

Type Quantized GGUF Model Size
Q4_0 Khetterman/Kosmos-8B-v1-Q4_0.gguf 4.34 GiB
Q6_K Khetterman/Kosmos-8B-v1-Q6_K.gguf 6.14 GiB
Q8_0 Khetterman/Kosmos-8B-v1-Q8_0.gguf 7.95 GiB

My thanks to the authors of the original models, your work is incredible. Have a good time πŸ–€

Downloads last month
93
GGUF
Model size
8.03B params
Architecture
llama

4-bit

6-bit

8-bit

Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Khetterman/Kosmos-8B-v1-GGUF

Quantized
(2)
this model