There are 5 repositories under document-search topic.
Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.
This open source chatbot project lets you create a chatbot that uses your own data to answer questions, thanks to the power of the OpenAI GPT-3.5 model.
Finding all pairs of similar documents time- and memory-efficiently
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
This code example shows how to make a chatbot for semantic search over documents using Streamlit, LangChain, and various vector databases. The chatbot lets users ask questions and get answers from a document collection. The code is in Python and can be customized for different scenarios and data.
A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.
Rust-based text search engine from scratch supporting multiple document similarity metrics (TF-IDF, BM25, BM25VA)
This module aims to make documents searchable for customers in Magento 2.
NLP Course By Deep learning.io powered by @coursera. Taught by: Younes Bensouda Mourri, Instructor of AI at Stanford University and Łukasz Kaiser, Staff Research Scientist at Google Brain.
This module aims to make documents searchable with product keywords in Magento 2.
COVID-19 comorbidities analysis platform based on Natural Language Processing(NLP)
Distributed document search using TF-IDF algorithm.
Mini desktop search engine with Binary Search Tree
Open Source Search Engine with built-in web/document crawler and an indexing method.
Apache Solr Document Search and Indexing Analysis with OCR
Given a set of PDFs and the query, the most relevant pdf can be found with the help of TF-IDF. The code has not used any library to implement TF-IDF
Website in PHP to index all pdf content and easy way to find any text
Information retrieval of text document using TF-IDF weighting & Cosine Similarity Algorithm.