There are 26 repositories under sentence-embeddings topic.
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Retrieval and Retrieval-augmented LLMs
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
A curated list of pretrained sentence and word embedding models
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
unified embedding model
SGPT: GPT Sentence Embeddings for Semantic Search
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Compute Sentence Embeddings Fast!
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
A Structured Self-attentive Sentence Embedding
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
Papers and Book to look at when starting AGI 📚
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
[NeurIPS 2019] Spherical Text Embedding
Clustering sentence embeddings to extract message intent
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.
[EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
Exploring the simple sentence similarity measurements using word embeddings
Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files
Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)
Finetune mistral-7b-instruct for sentence embeddings
Code for KaLM-Embedding models