VelvetToroyashi
/

WahtasticMerge

Model card Files Files and versions

VelvetToroyashi commited on Jul 9

Commit

f3b2feb

·

verified ·

1 Parent(s): 3c27225

Update README.md

Files changed (1) hide show

README.md +24 -1

README.md CHANGED Viewed

@@ -8,11 +8,34 @@ tags:
 Wahtastic Merge is a high-quality Stable Diffusion XL (SDXL) model designed to generate stunning images with improved aesthetics and excellent prompt adherence. This model is built upon the robust `noobai-XL-Vpred-1.0` base and has been further refined through the strategic merging of various other models and extensive additional training.
-Our focus during development was to enhance the overall visual appeal and ensure that the generated images accurately reflect your creative prompts, giving you more control and consistent results.
 *(Previously known as Pando Merge)*
 **ETH Wallet Address for Donations:** `0x645BebF82373865eC520d8AC2527524BfB174FF8`
 ## How to Use

 Wahtastic Merge is a high-quality Stable Diffusion XL (SDXL) model designed to generate stunning images with improved aesthetics and excellent prompt adherence. This model is built upon the robust `noobai-XL-Vpred-1.0` base and has been further refined through the strategic merging of various other models and extensive additional training.
+The ultimate goal of this model is to provide an experience very similar to the already fairly competent base of NoobAI v-pred, while fixing up rough edges.
+Many other merges suffer from the bimodality of either having good prompt adherence (closer to base noob) or good default aesthetics (closer to illustrious).
+Ideally, both can be encapsulated in a model without sacrificing too much model knowledge to acheive this.
+Up to V7, the model was entirely merged. V8 and above has additional fine-tuning applied atop the model for various fixes.
+# Wahtastic Roadmap
+- 1536x Super-resolution support
+  - Allow for 1536x native generation (and slightly above), akin to Illustrious 2+
+- Fix e6 size tag implications (hyper ≠ huge ≠ big)
+  - In short, e6 tags have implications; `hyper_*` implies `huge_*`, and `huge_*` implies `big_*`
+  - Because of this, the model leans to associate big with huge, and huge with hyper, causing `big_*` to cause disproportionately large body parts at times.
+- Natural language captioning
+  - Yes, CLIP sucks.
+  - Using lodestone-rock's natural-language captions, ideally some amount of natural language understanding can be brought back
+  - This is inspired by EasyFluff /XL
+- Superior style knowledge
+  - ~20k e6 artists with > 500 < 20 posts
+  - Potentially danbooru artists too
 *(Previously known as Pando Merge)*
+Compute is expensive, and while plenty has been granted to me by kind acquaintances, a fair bit of money has been poured into the training process
+If you like the model, or would like to help me offset the sunken cost of this, please consider donating:
 **ETH Wallet Address for Donations:** `0x645BebF82373865eC520d8AC2527524BfB174FF8`
+If you prefer PayPal/Stripe, please contact me on Discord @ velvet.toroyashi
 ## How to Use