kingbri commited on
Commit
4fc86c3
1 Parent(s): 0e8b9d9

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ ---
5
+
6
+ This is a GPTQ quantized version of [airo-llongma-2-13B-16k](https://huggingface.co/kingbri/airo-llongma-2-13b-16k)
7
+
8
+ To run this model, make sure `compress_pos_emb` is set to 4 to apply proper rope scaling parameters. The `max_ctx_len` is 16384.
9
+
10
+ The main branch of this repository is a 4bit 128g model with act order set to false. Sequence length was 4096 when quantizing.