Spaces:
Running
Running
fiesty-bear
commited on
Commit
•
95d12a8
1
Parent(s):
22342db
Readme
Browse files
README.md
CHANGED
@@ -1,43 +1,11 @@
|
|
1 |
-
# ClothingGAN: AI Powered Clothing Design Generator
|
2 |
-
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/mfrashad/ClothingGAN/blob/master/ClothingGAN_Demo.ipynb)
|
3 |
-
[![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/mfrashad/ClothingGAN)
|
4 |
|
5 |
-
|
6 |
-
|
7 |
-
|
8 |
-
|
9 |
-
|
10 |
-
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
## How I built it
|
17 |
-
I trained StyleGAN2-ADA on a subset of the Lookbook dataset. The total images I trained it on are 8,726 clothing images with a clean background. I transfer learned from FFHQ model and trained the model for a day.
|
18 |
-
|
19 |
-
After finished training the GAN, I proceeded to use GANSpace method to find important directions in the latent space. Then, I tried to guess what these directions represent and labeled them accordingly. The reason I use GANSpace is that it is unsupervised and does not need an attribute classifier.
|
20 |
-
|
21 |
-
Finally, I created a UI with Gradio UI library. All the development is done on Colab. Gradio made deployment very easy. I can directly deploy the UI from Colab where Gradio will create a proxy from the Colab server to their domain and the given URL, hence allowing the general public to use the UI or demo. However, since I cannot keep the Colab server on continuously due to GPU usage, ping me if you want to try out the demo.
|
22 |
-
|
23 |
-
|
24 |
-
|
25 |
-
## Challenges I ran into
|
26 |
-
One of the challenges I faced was fixing a memory leak issue. Part of the code keeps crashing, and I initially thought I cannot fit the model to the GPU memory, however, after hours of debugging, I finally found the code that has the memory leak.
|
27 |
-
|
28 |
-
## What I learned
|
29 |
-
I am already quite familiar with GAN but I have always been intimidated on deploying ML models. Luckily, I discovered Gradio UI, a library that makes ML deployment very easy. There were also other alternatives such as StreamLit or Dash, but found Gradio as the easiest to work with. One shortcoming is that it's quite inflexible in terms of customization.
|
30 |
-
|
31 |
-
## What's next for ClothingGAN
|
32 |
-
There is a lot of potential for the project. Some features that can be added are appearance transfer, image inversion (uploading & editing real image), generating the fashion model itself, conditional text input with OpenAI CLIP model, etc.
|
33 |
-
|
34 |
-
|
35 |
-
## Citation
|
36 |
-
```
|
37 |
-
@inproceedings{härkönen2020ganspace,
|
38 |
-
title = {GANSpace: Discovering Interpretable GAN Controls},
|
39 |
-
author = {Erik Härkönen and Aaron Hertzmann and Jaakko Lehtinen and Sylvain Paris},
|
40 |
-
booktitle = {Proc. NeurIPS},
|
41 |
-
year = {2020}
|
42 |
-
}
|
43 |
-
```
|
|
|
|
|
|
|
|
|
1 |
|
2 |
+
---
|
3 |
+
title: {{title}}
|
4 |
+
emoji: {{emoji}}
|
5 |
+
colorFrom: {{colorFrom}}
|
6 |
+
colorTo: {{colorTo}}
|
7 |
+
sdk: {{sdk}}
|
8 |
+
sdk_version: {{sdkVersion}}
|
9 |
+
app_file: app.py
|
10 |
+
pinned: false
|
11 |
+
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|