openai gradio llama-index pypdf sentence_transformers trafilatura pathlib