Nunya Biz's picture

Nunya Biz

SpyC0der77

AI & ML interests

None yet

Recent Activity

liked a Space about 4 hours ago
KingNish/Image-Gen-Pro
liked a model about 4 hours ago
madebyollin/taef1
liked a Space about 4 hours ago
ByteDance/Hyper-SDXL-1Step-T2I
View all activity

Organizations

None yet

SpyC0der77's activity

New activity in KingNish/Realtime-FLUX about 4 hours ago
New activity in AI-Platform/FLUXPro about 4 hours ago

Feature request: Add lora support

#1 opened about 4 hours ago by
SpyC0der77
reacted to fdaudens's post with ๐Ÿ‘ 12 days ago
view post
Post
3080
Is this the best tool to extract clean info from PDFs, handwriting and complex documents yet?

Open source olmOCR just dropped and the results are impressive.

Tested the free demo with various documents, including a handwritten Claes Oldenburg letter. The speed is impressive: 3000 tokens/second on your own GPU - that's 1/32 the cost of GPT-4o ($190/million pages). Game-changer for content extraction and digital archives.

To achieve this, Ai2 trained a 7B vision language model on 260K pages from 100K PDFs using "document anchoring" - combining PDF metadata with page images.

Best part: it actually understands document structure (columns, tables, equations) instead of just jumbling everything together like most OCR tools. Their human eval results back this up.

๐Ÿ‘‰ Try the demo: https://olmocr.allenai.org

Going right into the AI toolkit: JournalistsonHF/ai-toolkit
  • 3 replies
ยท