#

sentence-embeddings

Here are 258 public repositories matching this topic...

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

information-retrieval embeddings sentence-embeddings text-semantic-similarity llm retrieval-augmented-generation

Updated Jun 6, 2024
Python

BERTopic

MaartenGr / BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

nlp machine-learning topic transformers topic-modeling bert topic-models sentence-embeddings topic-modelling ldavis

Updated Jun 6, 2024
Python

derak-isaack / Ticket-Classification

Model to classify and categorize user complaints into categories for specific departments using LLMs.

openai gpt sentence-embeddings colab-notebook streamlit sentence-transformers

Updated Jun 5, 2024
Jupyter Notebook

SeanLee97 / AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Updated Jun 5, 2024
Python

smartIU / arxiv-topics

Adapted BERTopic pipeline for Topic Modeling the arXiv dataset

dash topic-modeling arxiv sentence-embeddings trend-analysis umap-hdbscan bertopic llama3

Updated Jun 4, 2024
Python

txtai

neuml / txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Updated Jun 4, 2024
Python

nikolamilosevic86 / local-genAI-search

Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files

search-engine local python3 sentence-embeddings msmarco sentence-transformers large-language-models generative-ai langchain qdrant-client llama3

Updated Jun 4, 2024
Python

SkywardAI / chat-backend

Backend for the AI-copilot

api ai container conversational-ai sentence-embeddings rag fastapi vector-database sentence-transformers llm-training llm-inference

Updated Jun 4, 2024
Python

Galal-pic / Talented-recruitment-and-skills-analysis-system

The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....

python nlp flask sqlalchemy scraping spacy transformer ner sentence-embeddings fine-tuning huggingface sentence-transformers cvanalysis

Updated Jun 3, 2024
HTML

EQTPartners / pause

🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴

nlp classification-algorithm similarity-search document-embedding sentence-embeddings positive-unlabeled-learning motherbrain

Updated May 30, 2024
Python

RuochenT / transformer_hybrid

This study aims to investigate the effectiveness of three Transformers (BERT, RoBERTa, XLNet) in handling data sparsity and cold start problems in the recommender system. We present a Transformer-based hybrid recommender system that predicts missing ratings and ex- tracts semantic embeddings from user reviews to mitigate the issues.

matrix-factorization transformer bert multilabel-classification sentence-embeddings hybrid-recommender-system roberta transformer-architecture xlnet cold-start-problem

Updated May 30, 2024
Jupyter Notebook

toninf / dense_retrieval

Word2vec, sentenceBert, BM25 and IVFFlat Index quality and speed comparison

word2vec sentence-embeddings faiss pyterrier

Updated May 28, 2024
Jupyter Notebook

harmonydata / pdf-questionnaire-extraction

Data and scripts for training the open source PDF questionnaire extraction component for Harmony Kaggle competition using natural language processing (NLP)

nlp competition open-source pdf data-science natural-language-processing information-retrieval text-mining text-classification kaggle information-extraction psychology research-project pdf-files psychology-experiments sentence-embeddings pdf-document-processor psychology-questionnaire

Updated May 27, 2024
Python

goamegah / torchSTC

PyTorch implementation of Self-training approch for short text clustering

machine-learning deep-learning clustering pytorch self-training autoencoder stc representation-learning short-text sentence-embeddings deep-clustering

Updated May 27, 2024
Python

louisbrulenaudet / tax-retrieval-benchmark

An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.

benchmark information-retrieval retrieval tax embeddings taxation semantic-search fiscal sentence-embeddings stp rag droit sentence-transformers sbert fiscalite retrieval-augmented-generation mteb

Updated May 26, 2024
Jupyter Notebook

LazarusNLP / indonesian-sentence-embeddings

Embedding Representation for Indonesian Sentences!

natural-language-processing indonesia unsupervised-learning indonesian semantic-textual-similarity sentence-embeddings sentence-transformers sbert

Updated May 22, 2024
Jupyter Notebook

dayyass / muse_tf2pt

Convert MUSE from TensorFlow to PyTorch and ONNX

multilingual machine-learning natural-language-processing encoder embeddings transformer embedder sentence-embeddings universal-sentence-encoder sentence-transformers

Updated May 22, 2024
Jupyter Notebook

JohnSnowLabs / nlu

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

Updated May 21, 2024
Python

Moradnejad / ColBERT-Using-BERT-Sentence-Embedding-for-Humor-Detection

ColBERT humor dataset for the task of humor detection, containing 200,000 jokes/news

paper model humor dataset bert humor-detection sentence-embeddings bert-embeddings colbert

Updated May 19, 2024
Jupyter Notebook

chungimungi / DocQA

A custom cross encoder used to predict the diseases from an input of symptoms

natural-language-processing healthcare sentence-embeddings disease-prediction cross-encoders

Updated May 7, 2024
Python

Improve this page

Add a description, image, and links to the sentence-embeddings topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sentence-embeddings topic, visit your repo's landing page and select "manage topics."