fastapi uvicorn python-multipart torch datasets pdf2image pytesseract transformers haystack-ai qdrant-haystack fastembed-haystack scikit-learn