Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,8 @@ tags:
|
|
14 |
|
15 |
Quantized using the [cleaned PIPPA](https://huggingface.co/datasets/royallab/PIPPA-cleaned) roleplay dataset.
|
16 |
|
|
|
|
|
17 |
- [2.4bpw6h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/2.4bpw6h) (may not load on 24GiB VRAM machines -- untested!)
|
18 |
|
19 |
- [3.0bpw8h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/3.0bpw8h)
|
|
|
14 |
|
15 |
Quantized using the [cleaned PIPPA](https://huggingface.co/datasets/royallab/PIPPA-cleaned) roleplay dataset.
|
16 |
|
17 |
+
- [2.25bpw6h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/2.25bpw6h) (tested and working on a single RTX 3090 24GiB at 16k context length)
|
18 |
+
|
19 |
- [2.4bpw6h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/2.4bpw6h) (may not load on 24GiB VRAM machines -- untested!)
|
20 |
|
21 |
- [3.0bpw8h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/3.0bpw8h)
|