bdsqlsz
/

stable-diffusion-xl-base-1.0_fixvae_fp16

stable-diffusion

Model card Files Files and versions Community

bdsqlsz commited on Jul 27, 2023

Commit

38c3c49

•

1 Parent(s): 2650324

Update README.md

Files changed (1) hide show

README.md +39 -0

README.md CHANGED Viewed

@@ -1,3 +1,42 @@
 ---
 license: openrail++
 ---

 ---
 license: openrail++
+tags:
+- stable-diffusion
+inference: false
 ---
+this is merge model for
+100% stable-diffusion-xl-base-1.0
+https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0
++
+sdxl-vae-fp16-fix
+https://huggingface.co/madebyollin/sdxl-vae-fp16-fix
+you can use this directly or finetune.
+same license on stable-diffusion-xl-base-1.0
+same vae license on sdxl-vae-fp16-fix
+# SDXL-VAE-FP16-Fix
+SDXL-VAE-FP16-Fix is the [SDXL VAE](https://huggingface.co/stabilityai/sdxl-vae)*, but modified to run in fp16 precision without generating NaNs.
+| VAE                   | Decoding in `float32` / `bfloat16` precision | Decoding in `float16` precision |
+| --------------------- | -------------------------------------------- | ------------------------------- |
+| SDXL-VAE              | ✅ ![](./images/orig-fp32.png)              | ⚠️ ![](./images/orig-fp16.png)  |
+| SDXL-VAE-FP16-Fix     | ✅ ![](./images/fix-fp32.png)               | ✅ ![](./images/fix-fp16.png)   |
+## Details
+SDXL-VAE generates NaNs in fp16 because the internal activation values are too big:
+![](./images/activation-magnitudes.jpg)
+SDXL-VAE-FP16-Fix was created by finetuning the SDXL-VAE to:
+1. keep the final output the same, but
+2. make the internal activation values smaller, by
+3. scaling down weights and biases within the network
+There are slight discrepancies between the output of SDXL-VAE-FP16-Fix and SDXL-VAE, but the decoded images should be close enough for most purposes.
+---