AuraFlow-v0.2 / README.md
cloneofsimo's picture
Upload folder using huggingface_hub
ea13150 verified
|
raw
history blame
2.18 kB
---
license: apache-2.0
tags:
- text-to-image
---
# AuraFlow v0.2
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6380ebb8471a4550ff255c62/jMkLXPFVNGdUb7P4nNTqX.png)
AuraFlow v0.2 is the fully open-sourced largest flow-based text-to-image generation model. The model was trained with more compute compared to the previous version, [AuraFlow-v0.1](https://huggingface.co/fal/AuraFlow)
This model achieves state-of-the-art results on GenEval. Read our [blog post](https://blog.fal.ai/auraflow/) for more technical details. You can also check out the comparison with other models on this gallery [page](https://cloneofsimo.github.io/compare_aura_sd3/).
The model is currently in beta. We are working on improving it and the community's feedback is important.
Join [fal's Discord](https://discord.gg/fal-ai) to give us feedback and stay in touch with the model development.
Credits: A huge thank you to [@cloneofsimo](https://twitter.com/cloneofsimo) and [@isidentical](https://twitter.com/isidentical) for bringing this project to life. It's incredible what two cracked engineers can achieve in
such a short period of time. We also extend our gratitude to the incredible researchers whose prior work laid the foundation for our efforts.
## Usage
```bash
$ pip install transformers accelerate protobuf sentencepiece
$ pip install git+https://github.com/huggingface/diffusers.git
```
```python
from diffusers import AuraFlowPipeline
import torch
pipeline = AuraFlowPipeline.from_pretrained(
"fal/AuraFlow-v0.2",
torch_dtype=torch.float16,
variant="fp16",
).to("cuda")
image = pipeline(
prompt="close-up portrait of a majestic iguana with vibrant blue-green scales, piercing amber eyes, and orange spiky crest. Intricate textures and details visible on scaly skin. Wrapped in dark hood, giving regal appearance. Dramatic lighting against black background. Hyper-realistic, high-resolution image showcasing the reptile's expressive features and coloration.",
height=1024,
width=1024,
num_inference_steps=50,
generator=torch.Generator().manual_seed(666),
guidance_scale=3.5,
).images[0]
image.save("output.png")
```