Text-to-Image
daoyuan98 commited on
Commit
17adac7
1 Parent(s): 9612229

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -5
README.md CHANGED
@@ -29,10 +29,11 @@ The feature alignment loss is designed in a way such that the output of `block-x
29
  The distillation process is performed with `512x512` Laion images recaptioned with `Qwen-VL` in the first stage for `90k steps`,
30
  and `1024x1024` images generated by `Flux` using the prompts in `JourneyDB` with another `90k steps`.
31
 
32
- ## Disclaimer
33
 
34
- Compared to other text-to-image models, our model was trained with limited computing resources and synthetic data with mediocre quality.
35
- Thus, the generation capability of our model is still limited in certain categories.
 
36
 
37
  The current model is ok with generating common images such as human/animal faces, landscapes, fantasy and abstract scenes.
38
  Unfortunately, it is still incompetent in many scenarios. Including but not limited to:
@@ -47,5 +48,3 @@ Since our model is trained with prompts in JourneyDB, we encourage users to appl
47
  For example: "profile of sad Socrates, full body, high detail, dramatic scene, Epic dynamic action, wide angle, cinematic, hyper-realistic, concept art, warm muted tones as painted by Bernie Wrightson, Frank Frazetta."
48
 
49
  Thank you for your attention! We will continue to improve our model and release new versions in the future.
50
-
51
- github link: https://github.com/TencentARC/flux-toolkits
 
29
  The distillation process is performed with `512x512` Laion images recaptioned with `Qwen-VL` in the first stage for `90k steps`,
30
  and `1024x1024` images generated by `Flux` using the prompts in `JourneyDB` with another `90k steps`.
31
 
32
+ ## Limitations
33
 
34
+ With limited computing and data resources, the capability of our Flux-mini is still limited in certain domains.
35
+ To facilitate the development of flux-based models, we open-sourced the codes to distill Flux in [this link](https://github.com/TencentARC/FluxKits).
36
+ We appeal people interested in this project to collaborate together to build a more applicable and powerful text-to-image model!
37
 
38
  The current model is ok with generating common images such as human/animal faces, landscapes, fantasy and abstract scenes.
39
  Unfortunately, it is still incompetent in many scenarios. Including but not limited to:
 
48
  For example: "profile of sad Socrates, full body, high detail, dramatic scene, Epic dynamic action, wide angle, cinematic, hyper-realistic, concept art, warm muted tones as painted by Bernie Wrightson, Frank Frazetta."
49
 
50
  Thank you for your attention! We will continue to improve our model and release new versions in the future.