Update README.md
Browse files
README.md
CHANGED
|
@@ -8,11 +8,34 @@ tags:
|
|
| 8 |
|
| 9 |
Wahtastic Merge is a high-quality Stable Diffusion XL (SDXL) model designed to generate stunning images with improved aesthetics and excellent prompt adherence. This model is built upon the robust `noobai-XL-Vpred-1.0` base and has been further refined through the strategic merging of various other models and extensive additional training.
|
| 10 |
|
| 11 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
|
| 13 |
*(Previously known as Pando Merge)*
|
| 14 |
|
|
|
|
|
|
|
|
|
|
| 15 |
**ETH Wallet Address for Donations:** `0x645BebF82373865eC520d8AC2527524BfB174FF8`
|
|
|
|
| 16 |
|
| 17 |
## How to Use
|
| 18 |
|
|
|
|
| 8 |
|
| 9 |
Wahtastic Merge is a high-quality Stable Diffusion XL (SDXL) model designed to generate stunning images with improved aesthetics and excellent prompt adherence. This model is built upon the robust `noobai-XL-Vpred-1.0` base and has been further refined through the strategic merging of various other models and extensive additional training.
|
| 10 |
|
| 11 |
+
The ultimate goal of this model is to provide an experience very similar to the already fairly competent base of NoobAI v-pred, while fixing up rough edges.
|
| 12 |
+
Many other merges suffer from the bimodality of either having good prompt adherence (closer to base noob) or good default aesthetics (closer to illustrious).
|
| 13 |
+
|
| 14 |
+
Ideally, both can be encapsulated in a model without sacrificing too much model knowledge to acheive this.
|
| 15 |
+
|
| 16 |
+
Up to V7, the model was entirely merged. V8 and above has additional fine-tuning applied atop the model for various fixes.
|
| 17 |
+
|
| 18 |
+
# Wahtastic Roadmap
|
| 19 |
+
- 1536x Super-resolution support
|
| 20 |
+
- Allow for 1536x native generation (and slightly above), akin to Illustrious 2+
|
| 21 |
+
- Fix e6 size tag implications (hyper ≠ huge ≠ big)
|
| 22 |
+
- In short, e6 tags have implications; `hyper_*` implies `huge_*`, and `huge_*` implies `big_*`
|
| 23 |
+
- Because of this, the model leans to associate big with huge, and huge with hyper, causing `big_*` to cause disproportionately large body parts at times.
|
| 24 |
+
- Natural language captioning
|
| 25 |
+
- Yes, CLIP sucks.
|
| 26 |
+
- Using lodestone-rock's natural-language captions, ideally some amount of natural language understanding can be brought back
|
| 27 |
+
- This is inspired by EasyFluff /XL
|
| 28 |
+
- Superior style knowledge
|
| 29 |
+
- ~20k e6 artists with > 500 < 20 posts
|
| 30 |
+
- Potentially danbooru artists too
|
| 31 |
|
| 32 |
*(Previously known as Pando Merge)*
|
| 33 |
|
| 34 |
+
Compute is expensive, and while plenty has been granted to me by kind acquaintances, a fair bit of money has been poured into the training process
|
| 35 |
+
If you like the model, or would like to help me offset the sunken cost of this, please consider donating:
|
| 36 |
+
|
| 37 |
**ETH Wallet Address for Donations:** `0x645BebF82373865eC520d8AC2527524BfB174FF8`
|
| 38 |
+
If you prefer PayPal/Stripe, please contact me on Discord @ velvet.toroyashi
|
| 39 |
|
| 40 |
## How to Use
|
| 41 |
|