# Folder structure
```
ORAL_PDF_QA/
├── __pycache__/
├── bge_model_ctranslate2/
├── data/
├── parsed/
├── logs/
├── pdf/
├── pictures/
├── tables/
├── venv/
├── .gitignore
├── chroma_service.py
├── config.py
├── gradio_demo.py
├── pdf_parsing_service.py
├── questions.txt
├── README.MD
├── requirements.txt
└── utils.py
```
# Download
```
pip install -r requirements.txt
```
Download `bge_model_ctranslate2` embedding model
Download `parsed` folder at https://drive.google.com/drive/folders/174I-pX1f7_mGG28Wwd9JPOgnOS5O16BA?usp=sharing
Download `tables` folder (extracted tables) from https://drive.google.com/drive/folders/12r0F_Ce25kecUSzp_HvjHjhrV6LbyYyx?usp=sharing
Download `pictures` folder (extracted pictures) from https://drive.google.com/drive/folders/1EvTLNNrBvQr-_lIzZSRL8ayrevKTmtJK?usp=sharing
# Usage
```
python chroma_service.py
```
```
pyrhon gradio_demo.py
```