openai tiktoken opencc docx2txt PyPDF2 plotly scipy