which vision model is R1 using for text extraction from image or pdfs.

#127

by ashutoshroy02 - opened 1 day ago

Discussion

ashutoshroy02

1 day ago

•

edited 1 day ago

i want to do ocr on a Archaeological book pdf (scanned book) . and extract text in markdown preceisly and very very accurately to make my RAG model super precise . i have tried lama-parser, docling , gemini, tessarct . i want to try deepseek for doing that

Reality123b

about 6 hours ago

i genuinely dont know how you will be able to do that. but deepseek is a company. Also, you may/can do that in deepseek chat if your book is small enough

oliszymanski

about 3 hours ago

i genuinely dont know how you will be able to do that. but deepseek is a company. Also, you may/can do that in deepseek chat if your book is small enough

He can try to make the file smaller by using ilovepdf. Should make it small enough to parse it.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment