Langchain AI Assistant with Hybrid RAG and Memory

This application can be configured (see config.py) to create your own specialized AI assistant.

AI Python framework: Langchain
Web interface Python framework: Streamlit
Vector DB: Chroma
Hybrid RAG: bm25 keyword search and vector db semantic search (BM25Retriever + vector_db.as_retriever = EnsembleRetriever). Hybrid RAG improves greatly the efficiency of the RAG search.
Chat history with limited size (use of predefined chains: history_aware_retriever, stuff_documents_chain, retrieval_chain)
Streaming of the AI answer
Logs sent to Langsmith
AI Models: OpenAI GPT 4o, Google Gemini 1.5, Anthropic Claude 3, Ollama (Llama 3, etc.). Vector size: 3072.
Admin interface (scrape web pages, upload PDF files, embed in vector DB, change model, etc.)
Files ingestion into the vector DB: JSON files (one JSON item / web page per chunk) and PDF files (one PDF page per chunk)

Frameworks and tools:

Langchain: https://www.langchain.com (Python framework for AI applications)
Langsmith: https://smith.langchain.com (logs and debug for Langchain applications)
Streamlit: https://streamlit.io (Python framework for web interfaces for data / AI applications)
Chroma: https://www.trychroma.com (Vector DB)
OpenAI (GPT): https://platform.openai.com (LLM)
Anthropic (Claude): https://console.anthropic.com/dashboard (LLM)
Google VertexAI (Gemini): https://cloud.google.com/vertex-ai (LLM)
Ollama (Llama, etc.): https://ollama.com (LLM)

Installation:

Requirements: Python 3.10+

Procedure to install the application on a Linux server:

$ git clone https://github.com/dodeeric/langchain-ai-assistant-with-hybrid-rag.git
$ cd langchain-ai-assistant-with-hybrid-rag

Add your API keys (only the OpenAI API key is mandatory) and admin password:

$ nano .env

OPENAI_API_KEY = "sk-proj-xxx"       ==> Go to https://platform.openai.com/api-keys
ANTHROPIC_API_KEY = "sk-ant-xxx"     ==> Go to https://console.anthropic.com/settings/keys
LANGCHAIN_API_KEY = "ls__xxx"        ==> Go to https://smith.langchain.com (Langsmith)
LANGCHAIN_TRACING_V2 = "true"        ==> Set to false if you will not use Langsmith traces
ADMIN_PASSWORD = "xxx"               ==> You chose your password
GOOGLE_APPLICATION_CREDENTIALS = "./serviceaccountxxx.json"  ==> Path to the Service Account (with VertexAI role) JSON file

Configure the application:

$ nano config/config.py

Install required libraries:

$ pip install -U -r requirements.txt

Launch the AI Assistant:

$ bash app.sh start

Remark: if you get the "streamlit: command not found" error, then log off, then log in, to have the PATH updated.

Go to: http://IP:8080 (the IP is displayed on the screen in the "External URL".)

Go first to the admin interface (introduce the admin password), and scrape some web pages and/or upload some PDF files, then embed them to the vector DB.

Install a reverse proxy (Nginx for example) on the server if you want to listen on port 80 (http) or 443 (https). It has to forward the requests from port 80 or 443 to port 8080.

Procedure to install and configure Nginx as reverse proxy:

For example:

Internal/private IP and port: 10.0.0.4:8080
External/public IP and port: 172.205.226.216:80 (domain name: bmae.edocloud.be)

$ sudo apt update
$ sudo apt install nginx
$ sudo nano /etc/nginx/sites-available/streamlitnginxconf

server {
  listen 80;
  server_name bmae.edocloud.be www.bmae.edocloud.be;
  location / {
    proxy_pass http://10.0.0.4:8080;
    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
    proxy_set_header Host $http_host;
    proxy_redirect off;
    proxy_http_version 1.1;
    proxy_set_header Upgrade $http_upgrade;
    proxy_set_header Connection "upgrade";
  }
}

$ sudo ln -s /etc/nginx/sites-available/streamlitnginxconf /etc/nginx/sites-enabled/streamlitnginxconf
$ sudo systemctl restart nginx

Go to: http://172.205.226.216 or http://bmae.edocloud.be

More info: https://ngbala6.medium.com/deploy-streamlit-app-on-nginx-cfa327106050

To configure a TLS certificate for https (port 443):

$ sudo snap install --classic certbot
$ sudo ln -s /snap/bin/certbot /usr/bin/certbot
$ sudo certbot --nginx -d bmae.edocloud.be -d www.bmae.edocloud.be
$ sudo certbot renew --dry-run

Go to: https://172.205.226.216 or https://bmae.edocloud.be

More info: https://www.digitalocean.com/community/tutorials/how-to-secure-nginx-with-let-s-encrypt-on-ubuntu-18-04

Check the Chroma vector DB: (OPTIONAL)

$ cd chromadb
$ sqlite3 chroma.sqlite3

sqlite> .tables                            ===> List of the tables
sqlite> select * from collections;         ===> Name of the collection (bmae) & size of the vectors (3072)
sqlite> select count(*) from embeddings;   ===> Number of records in the DB
sqlite> select id, key, string_value from embedding_metadata LIMIT 10 OFFSET 0;       ===> Display JSON items and PDF pages
sqlite> PRAGMA table_info(embedding_metadata);                                        ===> Structure of the table   
sqlite> select * from embedding_metadata where string_value like '%Delper%';          ===> Display matching records
sqlite> select count(*) from embedding_metadata where string_value like '%Delper%';   ===> Display number of matching records

Running Ollama / Llama 3 (or another LLM) locally:

Install Ollama:

$ curl -fsSL https://ollama.com/install.sh | sh

The Ollama API is now available at 127.0.0.1:11434.

Run the Llama 3 LLM:

$ ollama pull llama3
$ ollama list
$ ollama serve

Query the LLM in a new window:

$ ollama run llama3
>>> What's the capital of France?
>>> /bye

Run Chroma DB as a server:

a) Docker

$ docker pull chromadb/chroma
$ docker run -p 8081:8081 chromadb/chroma

b) Python

$ chroma run --path /db_path

Name		Name	Last commit message	Last commit date
Latest commit History 998 Commits
config		config
images		images
json_files		json_files
modules		modules
old_versions		old_versions
pages		pages
pdf_files		pdf_files
.gitignore		.gitignore
Assistant.py		Assistant.py
LICENSE		LICENSE
README.md		README.md
app.sh		app.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

images

images

json_files

json_files

modules

modules

old_versions

old_versions

pages

pages

pdf_files

pdf_files

.gitignore

.gitignore

Assistant.py

Assistant.py

LICENSE

LICENSE

README.md

README.md

app.sh

app.sh

requirements.txt

requirements.txt

Repository files navigation

Langchain AI Assistant with Hybrid RAG and Memory

About

Releases 4

Languages

License

dodeeric/langchain-ai-assistant-with-hybrid-rag

Folders and files

Latest commit

History

Repository files navigation

Langchain AI Assistant with Hybrid RAG and Memory

About

Topics

Resources

License

Stars

Watchers

Forks

Languages