TeamFnord
/

manga-ocr

vision-encoder-decoder

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

kha-white commited on Jan 20, 2022

Commit

76ff029

•

1 Parent(s): 0808c80

Create README.md

Files changed (1) hide show

README.md +26 -0

README.md ADDED Viewed

	@@ -0,0 +1,26 @@

+---
+language: ja
+tags:
+- image-to-text
+license: apache-2.0
+datasets:
+- manga109s
+---
+# Manga OCR
+Optical character recognition for Japanese text, with the main focus being Japanese manga.
+It uses [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/visionencoderdecoder) framework.
+Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality
+text recognition, robust against various scenarios specific to manga:
+- both vertical and horizontal text
+- text with furigana
+- text overlaid on images
+- wide variety of fonts and font styles
+- low quality images
+Code for inference is available [here](https://github.com/kha-white/manga_ocr).
+Code for training will be released soon.