mh-explo / README.md
amtam0's picture
add requirements+README
ab0437a

Gradio app to run QA on a given website

What does the app do :

  • Get input prompt (text or transcribed audio)
  • Similarity search in a vector DB, returns top k chunks
  • Run llm based on context (using basic prompt - langchain not used yet)
  • Return response and metadata (url, scores, text chunks...)

Setup process:

  • vector store (FAISS) a given website (sitemap) using langchain
  • download a llm
  • containarise stt (faster-whisper) due to os incompatibility
  • build Gradio app

To build Faster-whisper docker image name "fasterwhisper"

docker build -t . fasterwhisper

Create python env 'llama'

conda create --name="llama" python=3.10
conda activate llama
pip install -r requirements.txt

To run the app:

bash run.sh

To push to hugging face (including large files) use this command:

git lfs migrate import --everything

git add .; git commit -m "message"; git push -f origin