Quantized with these parameters:

--bits 4

--group_size 128

--desc_act 1

--damp 0.1

--seqlen 16384

--num_samples 512

Quantization Dataset: Erotiquant XL

Downloads last month
26
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Space using openerotica/Llama-3-lima-nsfw-16k-test-GPTQ 1