R136a1 commited on
Commit
fd51c67
1 Parent(s): 59f3877

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -0
README.md ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - not-for-all-audiences
4
+ - nsfw
5
+ license: other
6
+ language:
7
+ - en
8
+ ---
9
+
10
+ [EXL2](https://github.com/turboderp/exllamav2/tree/master#exllamav2) Quantization of [Undi95's's MXLewd-L2-20B](https://huggingface.co/Undi95/MXLewd-L2-20B).
11
+
12
+
13
+ ## Model details
14
+
15
+ First attempt to quantize a 20B model so it can run on 16GB VRAM with the highest quality possible.
16
+ Quantized at 3.23bpw with hb 6
17
+ Perplexity:
18
+ # Base = 6.4744
19
+ # 3.23 h6 = 6.5369
20
+ Dataset = [wikitext](https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)
21
+
22
+ ## Prompt Format
23
+
24
+ ```
25
+ Below is an instruction that describes a task. Write a response that appropriately completes the request.
26
+
27
+ ### Instruction:
28
+ {prompt}
29
+
30
+ ### Response:
31
+
32
+ ```