RAG-from-scratch

This repo containes codebase for a Retrieval-Augmented Generation (RAG) based chatbot at https://chat.jayeshdev.com that I built for my technical blog.

Asking the chatbot questions about my blog

This repo also has a companion blog post Chat with my blog: A RAG based chatbot that talks about me and my blog !

Codebase and Tech

The application is built using the following technologies:

Backend:
- Langchain for Retrieval-Augmented Generation (RAG) logic
- FastAPI & LangServe for serving API endpoints
- Langfuse for monitoring and tracking
- Chroma as the Vector Database
Frontend:
- NextJS & Chakra UI for the UI
- LangchainJS for interacting with backend APIs
Deployment:
- Docker for containerization and multi stage builds
- Docker Compose for orchestrating multi-container applications

The codebase is built on top of the excellent chat-langchain repo by langchain, and carries MIT License. I made the following modifications to the original code:

Backend:
- Refactor to use self hosted Chroma Vector Database (with security) instead of Weaviate Cloud.
- use Together AI for embedding (msmarco-bert-base-dot-v5) and answer generation (Mixtral-Instruct-v0.1).
- Add support for parsing using Unstructured IO during ingestion.
- An improved chain that generates better standalone questions and incorporates summary of chat history.
- Refactoring to improve modularity and maintainability.
- Improved prompts with step-by-step instructions and few-shot examples.
- Add support for using Open Source Langfuse instead of Langsmith for monitoring.
Frontend:
- Removed Langsmith integration
- Modified the example prompts and page contents
- Added footer element for links to my social
Deployment:
- Added Dockerfiles with multi stage building for backend and frontend to keep deployment lightweight.

Usage

1. Clone the repository

To get started, clone this repository to your local machine using the following command:

git clone https://github.com/jayeshmahapatra/rag-chatbot

2. Modify the configs and env files

Backend
- Modify dev.config or prod.config at rag_chatbot_backend/chatbot_backend/configs depending on your deployment target.
- Create a chroma/chroma.env file with the same format and info as chroma/chroma.env.example.
- Create a rag_chatbot_backend/keys.env file with the same format and info as rag_chatbot_backend/keys.env.example.
Frontend env file
- create a rag_chatbot_frontend/.env.local file with the same format and info as rag_chatbot_frontend/.env.example

3. Build the images and deploy the containers using docker compose

Use docker compose to build and deploy in detached mode.

docker compose -f docker-compose.dev.yml up --build -d

For production environment

docker compose -f docker-compose.prod.yml up --build -d

4. (Optional) Poplulate the Vector Database

If the mounted folders have no data in them, the Chroma Vector Database will be empty. You can populate it by running an interactive session with the backend container and running the ingestion_pipeline.py.

Find the name or ID of backend container using docker

docker ps

Launch an interactive session

docker exec -it <backend_container_id_or_name> /bin/bash

Execute the ingestion pipeline

python ingestion_pipeline.py

Repo Structure

The repository structure is organized as follows:

Root:
- Contains the Docker Compose files for both development docker-compose.dev.yml and production docker-compose.prod.yml.
Chroma:
- Contains environment files chroma.env for the Chroma Vector Database used in the project.
rag_chatbot_backend:
- Contains the backend codebase for the RAG chatbot.
- chatbot_backend:
  - Contains the core components of the chatbot backend:
    - chain: Implements the retrieval-augmented generation logic using langchain.
    - configs: Contains configuration files dev.config, prod.config for different deployment environments.
    - ingestion_pipeline.py: Script for populating the Chroma Vector Database.
    - main.py: Main FastAPI entry point for the backend langserve server.
    - utils: Contains utility functions that are used during ingestion.
rag_chatbot_frontend:
- Contains the frontend codebase for the RAG chatbot.
- app:
  - components: Contains reusable UI components for the chatbot interface.
  - globals.css: Global styles for the frontend.
  - layout.tsx and page.tsx: Layout and page components.
  - utils: Contains constants (constants.tsx).
- Other configuration and build files:
  - .env.example and .env.local: Environment variable files.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
Data		Data
chroma		chroma
media		media
rag_chatbot_backend		rag_chatbot_backend
rag_chatbot_frontend		rag_chatbot_frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.prod.yml		docker-compose.prod.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data

Data

chroma

chroma

media

media

rag_chatbot_backend

rag_chatbot_backend

rag_chatbot_frontend

rag_chatbot_frontend

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

docker-compose.dev.yml

docker-compose.dev.yml

docker-compose.prod.yml

docker-compose.prod.yml

Repository files navigation

RAG-from-scratch

Codebase and Tech

Usage

1. Clone the repository

2. Modify the configs and env files

3. Build the images and deploy the containers using docker compose

4. (Optional) Poplulate the Vector Database

Repo Structure

About

Releases

Packages

Contributors 2

Languages

License

jayeshmahapatra/rag-chatbot

Folders and files

Latest commit

History

Repository files navigation

RAG-from-scratch

Codebase and Tech

Usage

1. Clone the repository

2. Modify the configs and env files

3. Build the images and deploy the containers using docker compose

4. (Optional) Poplulate the Vector Database

Repo Structure

About

Topics

Resources

License

Stars

Watchers

Forks

Languages