amtam0 commited on
Commit
ab0437a
1 Parent(s): eaffd42

add requirements+README

Browse files
Files changed (2) hide show
  1. README.md +34 -0
  2. requirements.txt +9 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ### Gradio app to run QA on a given website
2
+
3
+ What does the app do :
4
+ - Get input prompt (text or transcribed audio)
5
+ - Similarity search in a vector DB, returns top k chunks
6
+ - Run llm based on context (using basic prompt - langchain not used yet)
7
+ - Return response and metadata (url, scores, text chunks...)
8
+
9
+ Setup process:
10
+ - vector store (FAISS) a given website (sitemap) using langchain
11
+ - download a llm
12
+ - containarise stt (faster-whisper) due to os incompatibility
13
+ - build Gradio app
14
+
15
+ To build Faster-whisper docker image name "fasterwhisper"
16
+
17
+ `docker build -t . fasterwhisper`
18
+
19
+ Create python env 'llama'
20
+ ```
21
+ conda create --name="llama" python=3.10
22
+ conda activate llama
23
+ pip install -r requirements.txt
24
+ ```
25
+
26
+ To run the app:
27
+
28
+ `bash run.sh`
29
+
30
+ To push to hugging face (including large files) use this command:
31
+
32
+ `git lfs migrate import --everything`
33
+
34
+ `git add .; git commit -m "message"; git push -f origin`
requirements.txt ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ faster_whisper==0.6.0
2
+ Flask==2.3.2
3
+ gradio==3.35.2
4
+ langchain==0.0.225
5
+ llama_cpp_python==0.2.7
6
+ Requests==2.31.0
7
+ torch==2.0.1
8
+ transformers==4.30.2
9
+ faiss-cpu==1.7.4