MTEB: Massive Text Embedding Benchmark
-
Updated
Jun 6, 2024 - Python
MTEB: Massive Text Embedding Benchmark
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications
Build and deploy a fully-featured, observable user-facing RAG backend in minutes.
Generative Representational Instruction Tuning
Retrieval Augmented Generative Engine
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
All-in-One: Text Embedding, Retrieval, Rerank and RAG
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.
Cottontail DB is a column store vector database aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.
Neural Search
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
advanced concepts of data, storage, organization, and retrieval. Topics include multiple-linked lists, balanced trees, graphs, abstract data types, classes and methods, object-oriented programming, searching and sorting.
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Voyage AI Official Python Library
Add a description, image, and links to the retrieval topic page so that developers can more easily learn about it.
To associate your repository with the retrieval topic, visit your repo's landing page and select "manage topics."