Update README.md
Browse files
README.md
CHANGED
@@ -29,10 +29,11 @@ The feature alignment loss is designed in a way such that the output of `block-x
|
|
29 |
The distillation process is performed with `512x512` Laion images recaptioned with `Qwen-VL` in the first stage for `90k steps`,
|
30 |
and `1024x1024` images generated by `Flux` using the prompts in `JourneyDB` with another `90k steps`.
|
31 |
|
32 |
-
##
|
33 |
|
34 |
-
|
35 |
-
|
|
|
36 |
|
37 |
The current model is ok with generating common images such as human/animal faces, landscapes, fantasy and abstract scenes.
|
38 |
Unfortunately, it is still incompetent in many scenarios. Including but not limited to:
|
@@ -47,5 +48,3 @@ Since our model is trained with prompts in JourneyDB, we encourage users to appl
|
|
47 |
For example: "profile of sad Socrates, full body, high detail, dramatic scene, Epic dynamic action, wide angle, cinematic, hyper-realistic, concept art, warm muted tones as painted by Bernie Wrightson, Frank Frazetta."
|
48 |
|
49 |
Thank you for your attention! We will continue to improve our model and release new versions in the future.
|
50 |
-
|
51 |
-
github link: https://github.com/TencentARC/flux-toolkits
|
|
|
29 |
The distillation process is performed with `512x512` Laion images recaptioned with `Qwen-VL` in the first stage for `90k steps`,
|
30 |
and `1024x1024` images generated by `Flux` using the prompts in `JourneyDB` with another `90k steps`.
|
31 |
|
32 |
+
## Limitations
|
33 |
|
34 |
+
With limited computing and data resources, the capability of our Flux-mini is still limited in certain domains.
|
35 |
+
To facilitate the development of flux-based models, we open-sourced the codes to distill Flux in [this link](https://github.com/TencentARC/FluxKits).
|
36 |
+
We appeal people interested in this project to collaborate together to build a more applicable and powerful text-to-image model!
|
37 |
|
38 |
The current model is ok with generating common images such as human/animal faces, landscapes, fantasy and abstract scenes.
|
39 |
Unfortunately, it is still incompetent in many scenarios. Including but not limited to:
|
|
|
48 |
For example: "profile of sad Socrates, full body, high detail, dramatic scene, Epic dynamic action, wide angle, cinematic, hyper-realistic, concept art, warm muted tones as painted by Bernie Wrightson, Frank Frazetta."
|
49 |
|
50 |
Thank you for your attention! We will continue to improve our model and release new versions in the future.
|
|
|
|