Update README.md
Browse files
README.md
CHANGED
|
@@ -37,8 +37,8 @@ This card in particular describes `Photoroom/prx-512-t2i-dc-ae-sft-distilled`, o
|
|
| 37 |
- **Architecture:** PRX (MMDiT-like diffusion transformer variant)
|
| 38 |
- **Latent backbone:** [DC-AE VAE](https://arxiv.org/abs/2410.10733)
|
| 39 |
- **Text encoder:** T5-Gemma-2B-2B-UL2
|
| 40 |
-
- **Training stage:** Supervised fine-tuning (SFT)
|
| 41 |
-
- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
|
| 42 |
|
| 43 |
For other checkpoints, browse the full [PRX collection](https://huggingface.co/collections/Photoroom/prx).
|
| 44 |
|
|
@@ -55,7 +55,7 @@ pipe = PRXPipeline.from_pretrained(
|
|
| 55 |
).to("cuda")
|
| 56 |
|
| 57 |
prompt = "A front-facing portrait of a lion in the golden savanna at sunset"
|
| 58 |
-
image = pipe(prompt, num_inference_steps=
|
| 59 |
image.save("lion.png")
|
| 60 |
```
|
| 61 |
|
|
|
|
| 37 |
- **Architecture:** PRX (MMDiT-like diffusion transformer variant)
|
| 38 |
- **Latent backbone:** [DC-AE VAE](https://arxiv.org/abs/2410.10733)
|
| 39 |
- **Text encoder:** T5-Gemma-2B-2B-UL2
|
| 40 |
+
- **Training stage:** Supervised fine-tuning (SFT) and 8-step distilled
|
| 41 |
+
- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
|
| 42 |
|
| 43 |
For other checkpoints, browse the full [PRX collection](https://huggingface.co/collections/Photoroom/prx).
|
| 44 |
|
|
|
|
| 55 |
).to("cuda")
|
| 56 |
|
| 57 |
prompt = "A front-facing portrait of a lion in the golden savanna at sunset"
|
| 58 |
+
image = pipe(prompt, num_inference_steps=8, guidance_scale=5.0).images[0]
|
| 59 |
image.save("lion.png")
|
| 60 |
```
|
| 61 |
|