There are 81 repositories under embeddings topic.
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
Open-source search and retrieval database for AI applications.
100+ Chinese Word Vectors 上百种预训练中文词向量
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Retrieval and Retrieval-augmented LLMs
LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
Postgres with GPUs for ML/AI apps.
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3
A blazing fast inference solution for text embeddings models
A python library for self-supervised learning on images.
A library for transfer learning by reusing parts of TensorFlow models.
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
A curated list of Generative AI tools, works, models, and references
One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, GPUStack & OpenAI compatible APIs. Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Basic Utilities for PyTorch Natural Language Processing (NLP)
Documentation for Google's Gen AI site - including the Gemini API and Gemma
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Predict stock market prices using RNN model with multilayer LSTM cells + optional multi-stock embeddings.
A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!