Undi95 commited on
Commit
9d6e34f
1 Parent(s): da6c08c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +59 -0
README.md ADDED
@@ -0,0 +1,59 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ tags:
4
+ - not-for-all-audiences
5
+ - nsfw
6
+ ---
7
+
8
+ ## Borealis
9
+
10
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63ab1241ad514ca8d1430003/ieXwQCpyL7AX659Z7ivxA.png)
11
+
12
+ Borealis-10.7B-DPO is a 10.7B model made of 48 Mistral 7B layers, finetuned for +70h on 2xA6000 on a big RP and Conversational dataset with llama2 configuration of Axolotl, like SOLAR.
13
+
14
+ This variant had a DPO train on top of it.
15
+
16
+ <!-- description start -->
17
+ ## Description
18
+
19
+ This repo contains fp16 files of Borealis-10.7B-DPO, a conversational model.
20
+
21
+ The goal of this model isn't to break all benchmark, but to have a better RP/ERP/Conversational model.
22
+
23
+ It was trained on multiple basic dataset to make it intelligent, but majority of the dataset was basic conversations.
24
+
25
+ <!-- description end -->
26
+ <!-- description start -->
27
+ ## Dataset used
28
+
29
+ - NobodyExistsOnTheInternet/ToxicQAFinal
30
+ - teknium/openhermes
31
+ - unalignment/spicy-3.1
32
+ - Doctor-Shotgun/no-robots-sharegpt
33
+ - Undi95/toxic-dpo-v0.1-sharegpt
34
+ - Aesir [1], [2], [3-SFW], [3-NSFW]
35
+ - lemonilia/LimaRP
36
+ - Squish42/bluemoon-fandom-1-1-rp-cleaned
37
+ - Undi95/ConversationChronicles-sharegpt-SHARDED (2 sets, modified)
38
+
39
+ ## DPO Dataset used
40
+
41
+ - Intel/orca_dpo_pairs
42
+ - NobodyExistsOnTheInternet/ToxicDPOqa
43
+ - Undi95/toxic-dpo-v0.1-NoWarning
44
+
45
+ <!-- description end -->
46
+ <!-- prompt-template start -->
47
+ ## Prompt format: NsChatml
48
+ ```
49
+ <|im_system|>
50
+ {sysprompt}<|im_end|>
51
+ <|im_user|>
52
+ {input}<|im_end|>
53
+ <|im_bot|>
54
+ {output}<|im_end|>
55
+ ```
56
+
57
+ ## Others
58
+
59
+ If you want to support me, you can [here](https://ko-fi.com/undiai).