GamerUntouch commited on
Commit
4858e2f
1 Parent(s): cadc080

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -1
README.md CHANGED
@@ -7,5 +7,36 @@ This is a collection of merged, then converted to 4bit LLaMA models trained on t
7
 
8
  UPDATE: 04/04
9
  Cleaned data and retrained to 32 groupsize and safetensors. Formatting oddities seem to have been wiped out.
10
-
11
  Format: Nothing notable, chapters separated by *** therefore may mess some things up.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
 
8
  UPDATE: 04/04
9
  Cleaned data and retrained to 32 groupsize and safetensors. Formatting oddities seem to have been wiped out.
 
10
  Format: Nothing notable, chapters separated by *** therefore may mess some things up.
11
+
12
+ UPDATE: 2024-04-18
13
+ Retrained and merged using updated LoRAs.
14
+
15
+ To merge and convert, used:
16
+ ```
17
+ transformers 4.28.1.
18
+ gptq triton branch fcf403f
19
+ llamacpp master branch 5ecff35
20
+ ```
21
+
22
+ Notes for usage.
23
+ ```
24
+ - These models are not instruct LoRAs. They are designed to supplement existing story data.
25
+ - There will likely be some bleedthrough on locations and names, this is especially notable if you use with very little context.
26
+ - There isn't any large notable formatting, ### seperated stories in the dataset, and *** seperated chapters.
27
+ ```
28
+
29
+ Currently transferring models over.
30
+ ```
31
+ 7B safetensors 4bit -
32
+ 7B ggml 4bit -
33
+
34
+ 13B safetensors 4bit -
35
+ 13B ggml 4bit -
36
+
37
+ 30B safetensors 4bit -
38
+ 30B ggml 4bit -
39
+ ```
40
+
41
+
42
+