GPTQ quantized falcon-rw-1b

Branch	Bits	GS	Act Order	Damp %	GPTQ Dataset	Seq Len	Size	ExLlama	Desc
main	4	None	No	0.01	c4	4096	--	No	4-bit, without Act Order and no grouop size.

Downloads last month: 6

Safetensors

Model size

1.08B params

Tensor type

I32

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.