--- license: mit language: - en base_model: - stable-diffusion-v1-5/stable-diffusion-v1-5 tags: - HTG - stable-diffusion - handwritten-text-generation metrics: - cer pipeline_tag: text-to-image --- # DiffusionPen: Towards Controlling the Style of Handwritten Text Generation Paper - *DiffusionPen: Towards Controlling the Style of Handwritten Text Generation (ECCV 2024)* Git repo: https://github.com/koninik/DiffusionPen **Model Description** This release includes pretrained models for DiffusionPen method. The repo includes: - IAM pre-processed dataset in .pt for direct loading in saved_iam_data - Style weights for the style encoder (also DiffusionPen-class and DiffusionPen-triplet) in style_models - DiffusionPen weights for IAM in diffusionpen_iam_model_path/models For VAE and DDIM we use stable-diffusion-v1-5/stable-diffusion-v1-5. More info on how to utilize weights and data files can be found in the git repo. **ArXiv** Nikolaidou, K., Retsinas, G., Sfikas, G. and Liwicki, M., 2024. DiffusionPen: Towards Controlling the Style of Handwritten Text Generation. arXiv preprint arXiv:2409.06065.