document-qa / README.md
lfoppiano's picture
update documentation
ebe573d
|
raw
history blame
965 Bytes
# DocumentIQA: Document Insight Question/Answer
## Introduction
Question/Answering on scientific documents. Upload your document and ask questions.
In our implementation we use [Grobid](https://github.com/kermitt2/grobid) for text extraction instead of the raw PDF2Text converter.
Thanks to Grobid we are able to precisely extract abstract and full-text.
This is just the beginning and publishing might help gathering more feedback.
**NOTE**: This project focus on scientific articles. Uploading books or other large document might not work as expected.
**Work in progress**
https://document-insights.streamlit.app/
**OpenAI or HuggingFace API KEY required**
### Screencast
This is a screencast on an older version:
https://github.com/lfoppiano/document-qa/assets/15426/b3882119-5a87-40f5-a2de-ad47447eb40c
### Acknolwedgement
This project is developed at the [National Institute for Materials Science](https://www.nims.go.jp) (NIMS) in Japan.