lllyasviel
/

omost-llama-3-8b-4bits

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Edit model card

omost-llama-3-8b-4bits is Omost's llama-3 model with 8k context length in nf4.

Downloads last month: 1,537

Safetensors

Model size

4.65B params

Tensor type

BF16

·

F32

·

U8

·

Inference Examples

Text Generation

Inference API (serverless) has been turned off for this model.