torch transformers pdf2image pytesseract