Skip to content

A prototype for visualizing and exploring vector document indexes

Notifications You must be signed in to change notification settings

brannt/vexplorer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Document Clustering Dash App

This Dash application performs K-means clustering on a vector index of embedded document chunks and visualizes the results using a Plotly 3D scatter plot. Users can interact with the visualization by clicking on clusters to see the top-3 chunks closest to the cluster's centroid.

Features

  • K-means clustering on document embeddings.
  • 3D scatter plot visualization of clusters.
  • Interactive cluster inspection in the UI.
  • Support for reading FAISS index files.

Installation

To set up the project environment, you need to install the required dependencies:

pip install -r requirements.txt

Usage

To run the app, use the following command:

python clustering_app.py <index_dir> <num_clusters>

Replace <index_dir> with the path to the directory containing the FAISS index and document storage, and <num_clusters> with the desired number of clusters.

Enjoy exploring your document clusters!

About

A prototype for visualizing and exploring vector document indexes

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages