flask gradio torch transformers langchain chromadb accelerate bitsandbytes