warp-ai
/

wuerstchen-prior

WuerstchenPriorPipeline

Model card Files Files and versions Community

dome272 commited on Sep 13, 2023

Commit

ced6b20

·

1 Parent(s): 77287c4

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ inference, its job is to generate the image latents given text. These image late
 ### Image Sizes
 Würstchen was trained on image resolutions between 1024x1024 & 1536x1536. We sometimes also observe good outputs at resolutions like 1024x2048. Feel free to try it out.
 We also observed that the Prior (Stage C) adapts extremely fast to new resolutions. So finetuning it at 2048x2048 should be computationally cheap.
-<img src="https://cdn-uploads.huggingface.co/production/uploads/634cb5eefb80cc6bcaf63c3e/IfVsUDcP15OY-5wyLYKnQ.jpeg" width=1000>
 ## How to run
 This pipeline should be run together with https://huggingface.co/warp-ai/wuerstchen:
@@ -62,6 +62,12 @@ decoder_output = decoder_pipeline(
 ).images
 ```
 ## Model Details
 - **Developed by:** Pablo Pernias, Dominic Rampas
 - **Model type:** Diffusion-based text-to-image generation model

 ### Image Sizes
 Würstchen was trained on image resolutions between 1024x1024 & 1536x1536. We sometimes also observe good outputs at resolutions like 1024x2048. Feel free to try it out.
 We also observed that the Prior (Stage C) adapts extremely fast to new resolutions. So finetuning it at 2048x2048 should be computationally cheap.
+<img src="https://cdn-uploads.huggingface.co/production/uploads/634cb5eefb80cc6bcaf63c3e/5pA5KUfGmvsObqiIjdGY1.jpeg" width=1000>
 ## How to run
 This pipeline should be run together with https://huggingface.co/warp-ai/wuerstchen:
 ).images
 ```
+### Image Sampling Times
+The figure shows the inference times (on an A100) for different batch sizes (`num_images_per_prompt`) on Würstchen compared to [Stable Diffusion XL](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0) (without refiner).
+The left figure shows inference times (using torch > 2.0), whereas the right figure applies `torch.compile` to both pipelines in advance.
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/634cb5eefb80cc6bcaf63c3e/UPhsIH2f079ZuTA_sLdVe.jpeg)
 ## Model Details
 - **Developed by:** Pablo Pernias, Dominic Rampas
 - **Model type:** Diffusion-based text-to-image generation model