There are 27 repositories under embedding topic.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
100+ Chinese Word Vectors 上百种预训练中文词向量
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).
A community-driven way to read and chat with AI bots - powered by chatGPT.
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Siamese and triplet networks with online pair/triplet mining in PyTorch
🔍 Search your telegram messages wisely | 搜索您的 Telegram 聊天记录
Data framework for your LLM applications. Focus on server side solution
All-in-one platform for search, recommendations, RAG, and analytics offered via API
A curated list of community detection research papers with implementations.
UltraRAG 2.0: Less Code, Lower Barrier, Faster Deployment! MCP-based low-code RAG framework, enabling researchers to build complex pipelines to creative innovation.
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Extensible, parallel implementations of t-SNE
🔍大模型应用开发实战一:RAG技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/
The TypeScript library for building AI applications.
WFGY 2.0. Semantic Reasoning Engine for LLMs (MIT). Fixes RAG/OCR drift, collapse & “ghost matches” via symbolic overlays + logic patches. Autoboot; OneLine & Flagship. ⭐ Star if you explore semantic RAG or hallucination mitigation.
A @ClickHouse fork that supports high-performance vector search and full-text search.
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Four word embedding models implemented in Python. Supporting arbitrary context features
Generative Representational Instruction Tuning
Embedding, NMT, Text_Classification, Text_Generation, NER etc.
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
Quickly and easily build AI website or application by using embeddings!
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.
一款JavaSDK用于快速接入AI大模型应用,整合多平台大模型,如OpenAi、智谱Zhipu(ChatGLM)、深度求索DeepSeek、月之暗面Moonshot(Kimi)、腾讯混元Hunyuan、零一万物(01)等等,提供统一的输入输出(对齐OpenAi)消除差异化,优化函数调用(Tool Call),优化RAG调用、支持向量数据库(Pinecone)、内置联网增强,并且支持JDK1.8,为用户提供快速整合AI的能力。