camenduru's picture
thanks to ausboss ❤
81d47ba
quantized this [model](https://huggingface.co/ehartford/WizardLM-13B-Uncensored)
CUDA_VISIBLE_DEVICES=0 python llama.py ehartford/WizardLM-13B-Uncensored c4 --wbits 4 --true-sequential --groupsize 128 --save_safetensors 4bit-128g.safetensors