Clement Vachet commited on
Commit
2ff7a91
·
1 Parent(s): 5eb3904

Add general project description

Browse files
Files changed (1) hide show
  1. README.md +34 -1
README.md CHANGED
@@ -9,4 +9,37 @@ app_file: app.py
9
  pinned: true
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  pinned: true
10
  ---
11
 
12
+
13
+ [![](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
14
+ [![code style](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
15
+ [![linting: pylint](https://img.shields.io/badge/linting-pylint-yellowgreen)](https://github.com/pylint-dev/pylint)
16
+
17
+
18
+ **Aim: PDF-based AI chatbot with retrieval augmented generation**
19
+
20
+
21
+ **Architecture / Tech stack:**
22
+ - Front-end:
23
+ - user interface via Gradio library
24
+ - Back-end:
25
+ - HuggingFace embeddings
26
+ - HuggingFace API for open-source LLMs
27
+ - Chromadb vector database
28
+ - LangChain conversational retrieval chain
29
+
30
+
31
+ **Description:**
32
+ - This AI assistant, using Langchain and open-source LLMs, performs retrieval-augmented generation (RAG) from your PDF documents. The user interface explicitely shows multiple steps to help understand the RAG workflow. This chatbot takes past questions into account when generating answers (via conversational memory), and includes document references for clarity purposes. It leverages small LLM models to run directly on CPU hardware.
33
+
34
+
35
+ **Available open-source LLMs:**
36
+ - Meta Llama series
37
+ - Alibaba Qwen2.5 series
38
+ - Mistral AI models
39
+ - Microsoft Phi-3.5 series
40
+ - Google Gemma models
41
+ - HuggingFace zephyr and SmolLM series
42
+
43
+
44
+ You can try out the deployed [Hugging Face Space](https://huggingface.co/spaces/cvachet/pdf-chatbot)!
45
+