opencv-python pymupdf numpy pytesseract pillow langchain-community langchain-openai langchain_core sklearn numpy gradio tqdm uuid