341
OCR
π
olmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr
Comprehensive Demo of Multimodal VLMs on the Hub
olmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr
camel doc ocr / core ocr / docscope ocr / monkey ocr
deepcaption / skycaptioner /spacethinker / spaceom / coreocr
nanonets ocr / smoldocling / monkey ocr / typhoon ocr
Florence-2-large / Florence-2-base
cosmos reason1 / docscopeocr / visionocr / captioner relaxed
qwen2.5-vl-7b / qwen2.5-vl-3b / abliterated-caption-it / vlr
thinking / ocr / reasoning
ocr / thinking - vlm
Experiment with the Tiny VLMs here