There are 12 repositories under text-embedding topic.
MTEB: Massive Text Embedding Benchmark
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
SGPT: GPT Sentence Embeddings for Semantic Search
Generative Representational Instruction Tuning
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Efficient LLM inference on Slurm clusters using vLLM.
Go module for fetching embeddings from embeddings providers
a vector embedding database with multiple storage engines and AI embedding integrations
Simple script to compute CLIP-based scores given a DALL-e trained model.
Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs
Official codebase for the ACL 2025 Findings paper: Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval.
Perform topic classification on news articles in several limited-labeled data regimes.
HSTU-BLaIR: Lightweight Contrastive Text Embedding for Generative Recommender 🌱
Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.
Topic Embedding, Text Generation and Modeling using diffusion
🧠 ML-Article-Classifier is a modular Python project for classifying articles using advanced NLP techniques. It features sentence embeddings, clustering, and classification utilities, with Jupyter notebook demos, extensible helper functions, and best practices for research and production use.
Flask API for generating text embeddings using OpenAI or sentence_transformers
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
A Node-RED node that interacts with OpenAI machine learning models to generate text like ChatGPT
Contextual embedding for text blobs.
Using Semantic Kernel to obtain answer from a PDF document, with embeddings stored in Redis and HuggingFace to create embeddings.
I have improved the demo by using Azure OpenAI’s Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.
PL-MTEB: Polish Massive Text Embedding Benchmark
Mind-X is my intelligent alter ego that understands me the best. It assists with and resolves my bothersome tasks, growing in real-time as a next-generation PersonAI system.
An image retrieval engine . 图像检索系统。
🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.
Чат-бот с LLL + RAG
Image Steganography GUI | Easily Hide Text Files within Images with User-Friendly GUI | Pyton Tool
Embedding a text to a vector by pre-trained BERT word embeddings and pooling layers, for the pur[ose of text similarity measuring
Babel Street Analytics Client Library for C#
A semantic search system built with PostgreSQL and pgvector, powered by Gemini for generating text embeddings..