--- datasets: - cQueenccc/Vivian-Blip-Captions language: - en pipeline_tag: text-to-image --- # Disclaimer This was inspired from https://github.com/YaYaB/finetune-diffusion # Model Card for Finetuning Stable Diffusion on Vivian Maier's photographs The main goal is to fine-tune the Stable Diffusion model to generate images reflecting the distinct photographic style of Vivian Maier. And I chose to utilize a Jupyter Notebook to make the fine-tuning process accessible and easy to understand, particularly for those new to the diffusion pipeline and hugging face API. # Requirements To launch the finetuning with a batch_size of 1 you need to have a gpu with at least 24G VRAM (you can use accumulating gradient to simulate higher batch size) Make sure that you have enough disk space, the model uses ~11Gb ## Examples(at epoch 90) ![vv1.jpg](https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/resolve/main/eval/A%20woman%20walking%20down%20the%20street/A%20woman%20walking%20down%20the%20street_90_000000.png) > A woman walking down a street ![vv2.jpg](https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/resolve/main/eval/a%20group%20of%20people%20getting%20on%20a%20bus/a%20group%20of%20people%20getting%20on%20a%20bus_90_000000.png) > a group of people getting on a bus ![vv3.jpg](https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/resolve/main/eval/two%20men%20working%20on%20a%20construction%20site/two%20men%20working%20on%20a%20construction%20site_90_000000.png) > two man working on a constructing site ## Citation If you use this dataset, please cite it as: ``` @misc{cqueenccc2023vivian, author = {cQueenccc}, title = {Finetuning Stable Diffusion on Vivian Maier's photographs}, year={2023}, howpublished= {\url{https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/}} } ```