xiaozaa commited on
Commit
476deea
1 Parent(s): 1a83523

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +90 -3
README.md CHANGED
@@ -1,3 +1,90 @@
1
- ---
2
- license: cc-by-nc-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: diffusers
3
+ license: cc-by-nc-2.0
4
+ base_model:
5
+ - black-forest-labs/FLUX.1-Fill-dev
6
+ pipeline_tag: image-to-image
7
+ tags:
8
+ - tryon
9
+ - vto
10
+ ---
11
+
12
+ # Model Card for CAT-Tryoff-Flux
13
+
14
+ CAT-Tryoff-Flux is an advanced tryoff model. It used the same method of (CATVTON-FLUX)[https://huggingface.co/xiaozaa/catvton-flux-alpha]. This model can extract and reconstruct the front view of clothing items from images of people wearing them.
15
+
16
+ ## Model Details
17
+
18
+ ### Model Description
19
+
20
+ <!-- Provide a longer summary of what this model is. -->
21
+
22
+ - **Developed by:** [X/Twitter:Black Magic An](https://x.com/MrsZaaa)
23
+
24
+ ### Model Sources [optional]
25
+
26
+ <!-- Provide the basic links for the model. -->
27
+
28
+ - **Repository:** [github](https://github.com/nftblackmagic/catvton-flux)
29
+
30
+ ## Uses
31
+
32
+ The model is designed for virtual try-off applications, allowing users to visualize how different garments would look on a person. It can be used directly through command-line interface with the following parameters:
33
+
34
+ Input person image
35
+ Person mask
36
+ Garment image
37
+ Random seed (optional)
38
+
39
+ ## How to Get Started with the Model
40
+
41
+ ```
42
+ transformer = FluxTransformer2DModel.from_pretrained(
43
+ "xiaozaa/cat-tryoff-flux",
44
+ torch_dtype=torch.bfloat16
45
+ )
46
+ pipe = FluxFillPipeline.from_pretrained(
47
+ "black-forest-labs/FLUX.1-dev",
48
+ transformer=transformer,
49
+ torch_dtype=torch.bfloat16
50
+ ).to("cuda")
51
+
52
+
53
+
54
+ ```
55
+
56
+ ## Training Details
57
+
58
+ ### Training Data
59
+
60
+ VITON-HD dataset
61
+
62
+ ### Training Procedure
63
+
64
+ Finetuning Flux1-dev-fill
65
+
66
+
67
+ ## Evaluation
68
+
69
+ #### Summary
70
+
71
+
72
+
73
+ **BibTeX:**
74
+ ```
75
+ @misc{chong2024catvtonconcatenationneedvirtual,
76
+ title={CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models},
77
+ author={Zheng Chong and Xiao Dong and Haoxiang Li and Shiyue Zhang and Wenqing Zhang and Xujie Zhang and Hanqing Zhao and Xiaodan Liang},
78
+ year={2024},
79
+ eprint={2407.15886},
80
+ archivePrefix={arXiv},
81
+ primaryClass={cs.CV},
82
+ url={https://arxiv.org/abs/2407.15886},
83
+ }
84
+ @article{lhhuang2024iclora,
85
+ title={In-Context LoRA for Diffusion Transformers},
86
+ author={Huang, Lianghua and Wang, Wei and Wu, Zhi-Fan and Shi, Yupeng and Dou, Huanzhang and Liang, Chen and Feng, Yutong and Liu, Yu and Zhou, Jingren},
87
+ journal={arXiv preprint arxiv:2410.23775},
88
+ year={2024}
89
+ }
90
+ ```