zgce's picture
Update README.md
eee2c8a
|
raw
history blame
815 Bytes
---
license: mit
---
This is exl2 format model.
### Yi-34b-200K-alpaca-rpv3-4bpw-hb6-exl2
- base model: [Yi-34B-200K](https://huggingface.co/01-ai/Yi-34B-200K)
- LoRA: [Yi-34b-alpaca-cot-lora](https://huggingface.co/zzlgreat/Yi-34b-alpaca-cot-lora)
- LoRA: [limarpv3-yi-llama-34b-lora](https://huggingface.co/Doctor-Shotgun/limarpv3-yi-llama-34b-lora)
### description
- This is test for [exllamav2](https://github.com/turboderp/exllamav2) model
- 4.15bpw `python convert.py -i Yi-34b-200K-alpaca-rpv3 -c exl2/0000.parquet -o Yi-34b-200K-alpaca-rpv3-4bpw-hb6-exl2 -hb 6 -l 4096 -b 4.15`
- [convert doc](https://github.com/turboderp/exllamav2/blob/master/doc/convert.md)
- calibration dataset: [WikiText-2-v1](https://huggingface.co/datasets/wikitext/blob/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)