File size: 2,715 Bytes
7c92b5b
 
913fa6d
0e29863
7c92b5b
913fa6d
 
 
 
7c92b5b
913fa6d
7c92b5b
913fa6d
 
 
7c92b5b
 
913fa6d
7c92b5b
913fa6d
 
 
 
7c92b5b
913fa6d
7c92b5b
59b833c
 
 
 
 
 
 
913fa6d
7c92b5b
913fa6d
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
tags:
- not-for-all-audiences
license: apache-2.0
---
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/3UZPNZmXJMiZqbSsGvNNX.png"/><font size="7"><b>Aeonis-20b</b></font></p>
<p align="center"><font size="4"><b>Based on Mistral NeMo.</b></font></p>
<p align="center"><font size="4"><b>Trained with Alpaca prompt formatting, Mistral works</b></font></p>
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/e9dVPISXzBY6SuGqfjc9L.png"/></p>

-----

<p align="center"><font size="5"> <b>Assistant Examples - 8-bit GGUF</b> </font></p>
<p align="center"><font size="3"> <b>(basic Ooba preset, assistant character, and system prompt)</b> </font></p>
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/fjfUfknR0Zu2zxQRMYQiJ.png"/></p>


-----

<p align="center"><font size="5"> <b>NSFW Writing Example - 8-bit GGUF</b> </font></p>
<p align="center"><font size="3"> <b>Prompt: "Write a detailed, erotic story about a stripper sleeping with her co-worker"</b> </font></p>
<p align="center"><font size="3"> <b>(basic Ooba preset, assistant character, and system prompt)</b> </font></p>
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/m1mKHxb7RsqYV2UO_1wzX.png"/></p>

-----

<p align="center"><font size="5"> <b>Compantionship Chat Example - 8-bit GGUF</b> </font></p>
<p align="center"><font size="3"> <b>Using Goldie, one of the top characters on Chub.ai</b> </font></p>
<p align="center"><font size="3"> <b>(basic Ooba preset and system prompt)</b> </font></p>
<p align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/633a809fa4a8f33508dce32c/wNdL-rlbnwZmNBMoH9K8k.png"/></p>

-----

<p align="center"><font size="5"> <b>Training Methodology</b> </font></p>

<p align="center">The model was trained on a variation of TheSkullery/NeMoria-21b, made by finetuning two NeMo models, one for each added “core” (set of repeated layers). One model was overfit to RP data, and the other was overfit to factual data and input analysis. Then the base NeMo was stitched together with the two models, so the repeated portion was one vanilla NeMo core, then the “Virgin” core, then the “Slut” core, a series of layers I like to call the “Whore/Madonna complex”. Now in place, the entire model was continually pretrained on ~1.5 GB private dataset of domain data mixed with stabilizing agents. The Virgin and Slut cores were then each instruct trained on their domains with all other layers frozen, one at a time. Finally, the entire model was SFT’d and DPO’d.