There are 5 repositories under document-search topic.
Building blocks for rapid development of GenAI applications
🥥 Coco AI App - Search, Connect, Collaborate, Personal AI Search and Assistant, all in one space.
A lightweight, production-ready RAG (Retrieval Augmented Generation) library in Go.
Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.
This open source chatbot project lets you create a chatbot that uses your own data to answer questions, thanks to the power of the OpenAI GPT-3.5 model.
🥥 Coco AI Server - Search, Connect, Collaborate, AI-powered Enterprise Search, all in one space.
BM25 search implemented in PL/pgSQL
Finding all pairs of similar documents time- and memory-efficiently
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
faster BM25 search algorithms in Python
PostgreSQL-native semantic search engine with multi-modal capabilities. Add AI-powered search to your existing database without separate vector databases, vendor fees, or complex setup. Features text + image search using CLIP embeddings, native SQL joins, and 10-minute Docker deployment.
This code example shows how to make a chatbot for semantic search over documents using Streamlit, LangChain, and various vector databases. The chatbot lets users ask questions and get answers from a document collection. The code is in Python and can be customized for different scenarios and data.
AI-powered file launcher and semantic search assistant. Like Spotlight/Alfred but with advanced AI capabilities for understanding context and meaning. Features local processing, privacy-first design, and seamless integration with your workflow.
Chat with your PDFs using AI! This Streamlit app uses RAG, LangChain, FAISS, and OpenAI to let you ask questions and get answers with page and file references.
Database Freedom Platform - Mathematical search optimization for whatever database you already have. 27,000x faster than vector databases with SMT-powered search across 8+ database types. One-time 9-2999 vs 00-500/month recurring.
A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.
Chat with your PDFs using AI! This Streamlit app uses RAG, LangChain, FAISS, and OpenAI to let you ask questions and get answers with page and file references.
SmartRAG is a terminal-based RAG system using LangGraph. It processes queries by retrieving relevant content from markdown or PDFs, then responds using OpenAI GPT. Supports webpage-to-PDF conversion, vector DB search, and modular flow control.
Rust-based text search engine from scratch supporting multiple document similarity metrics (TF-IDF, BM25, BM25VA)
This module aims to make documents searchable for customers in Magento 2.
Node.js bindings for Tantivy. Provides indexing, querying, and advanced search features with TypeScript support.
Distributed document search using TF-IDF algorithm.
AI-powered hybrid search engine combining keyword, vector, and LLM-based contextual search using RAG with support for AI21, OpenAI or any other LLM.
NLP Course By Deep learning.io powered by @coursera. Taught by: Younes Bensouda Mourri, Instructor of AI at Stanford University and Łukasz Kaiser, Staff Research Scientist at Google Brain.
Semantik is a self-hosted semantic search engine for your documents.
This module aims to make documents searchable with product keywords in Magento 2.
The extended version of simhash supports fingerprint extraction of documents and images.
Semantic document search system with pgvector and PGAI