VelvetToroyashi commited on
Commit
f3b2feb
·
verified ·
1 Parent(s): 3c27225

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -1
README.md CHANGED
@@ -8,11 +8,34 @@ tags:
8
 
9
  Wahtastic Merge is a high-quality Stable Diffusion XL (SDXL) model designed to generate stunning images with improved aesthetics and excellent prompt adherence. This model is built upon the robust `noobai-XL-Vpred-1.0` base and has been further refined through the strategic merging of various other models and extensive additional training.
10
 
11
- Our focus during development was to enhance the overall visual appeal and ensure that the generated images accurately reflect your creative prompts, giving you more control and consistent results.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
 
13
  *(Previously known as Pando Merge)*
14
 
 
 
 
15
  **ETH Wallet Address for Donations:** `0x645BebF82373865eC520d8AC2527524BfB174FF8`
 
16
 
17
  ## How to Use
18
 
 
8
 
9
  Wahtastic Merge is a high-quality Stable Diffusion XL (SDXL) model designed to generate stunning images with improved aesthetics and excellent prompt adherence. This model is built upon the robust `noobai-XL-Vpred-1.0` base and has been further refined through the strategic merging of various other models and extensive additional training.
10
 
11
+ The ultimate goal of this model is to provide an experience very similar to the already fairly competent base of NoobAI v-pred, while fixing up rough edges.
12
+ Many other merges suffer from the bimodality of either having good prompt adherence (closer to base noob) or good default aesthetics (closer to illustrious).
13
+
14
+ Ideally, both can be encapsulated in a model without sacrificing too much model knowledge to acheive this.
15
+
16
+ Up to V7, the model was entirely merged. V8 and above has additional fine-tuning applied atop the model for various fixes.
17
+
18
+ # Wahtastic Roadmap
19
+ - 1536x Super-resolution support
20
+ - Allow for 1536x native generation (and slightly above), akin to Illustrious 2+
21
+ - Fix e6 size tag implications (hyper ≠ huge ≠ big)
22
+ - In short, e6 tags have implications; `hyper_*` implies `huge_*`, and `huge_*` implies `big_*`
23
+ - Because of this, the model leans to associate big with huge, and huge with hyper, causing `big_*` to cause disproportionately large body parts at times.
24
+ - Natural language captioning
25
+ - Yes, CLIP sucks.
26
+ - Using lodestone-rock's natural-language captions, ideally some amount of natural language understanding can be brought back
27
+ - This is inspired by EasyFluff /XL
28
+ - Superior style knowledge
29
+ - ~20k e6 artists with > 500 < 20 posts
30
+ - Potentially danbooru artists too
31
 
32
  *(Previously known as Pando Merge)*
33
 
34
+ Compute is expensive, and while plenty has been granted to me by kind acquaintances, a fair bit of money has been poured into the training process
35
+ If you like the model, or would like to help me offset the sunken cost of this, please consider donating:
36
+
37
  **ETH Wallet Address for Donations:** `0x645BebF82373865eC520d8AC2527524BfB174FF8`
38
+ If you prefer PayPal/Stripe, please contact me on Discord @ velvet.toroyashi
39
 
40
  ## How to Use
41