Khetterman
/

Kosmos-8B-v1-GGUF

Text Generation

Not-For-All-Audiences

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Kosmos-8B-v1 GGUF Quantizations 🗲

The serenity of infinity is not the end.

This model was converted to GGUF format using llama.cpp.

For more information of the model, see the original model card: Khetterman/Kosmos-8B-v1.

Available Quantizations (◕‿◕)

Type	Quantized GGUF Model	Size
Q4_0	Khetterman/Kosmos-8B-v1-Q4_0.gguf	4.34 GiB
Q6_K	Khetterman/Kosmos-8B-v1-Q6_K.gguf	6.14 GiB
Q8_0	Khetterman/Kosmos-8B-v1-Q8_0.gguf	7.95 GiB

My thanks to the authors of the original models, your work is incredible. Have a good time 🖤

Downloads last month: 93

GGUF

Model size

8.03B params

Architecture

llama

4-bit

6-bit

8-bit

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Khetterman/Kosmos-8B-v1-GGUF

Base model

Khetterman/Kosmos-8B-v1

Quantized

(2)

this model