There are 1 repository under query-processing topic.
A composable and fully extensible C++ execution engine library for data management systems.
Leverages extensive power of multiple Machine Learning algorithms & LLM to provide in-depth answers to medical queries and predicts condition/diseases based on patient symptoms
Codes for our ADMS'24 paper "Bandwidth Expansion via CXL: A Pathway to Accelerating In-Memory Analytical Processing".
A high-performance search engine that crawls web pages, indexes content, calculates PageRank scores, and provides modern UI for searching.
A DBMS project on Textile Store Management using StreamLit-Python for the frontend app
Efficient caching for non-volatile memory-backed search indices. A thesis submitted for undergraduate honours for the degree Software Engineering.
Source code for the paper "k-Distance Approximation for Memory-Efficient RkNN Retrieval"
A Python library for intelligent file filtering using SQL expressions and metadata-based scan planning. This library enables efficient data lake query optimization by determining which files need to be scanned based on their statistical metadata.
An efficient layer to perform query processing on top of Terrier
Enhance the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications. Learn to integrate vector search with traditional database operations and apply techniques like prefiltering, postfiltering, projection, and prompt compression.
Skript zur Vorlesung Datenbanksysteme II | Lecture notes for Database Systems II
Labs for Databases II
Simple search system that includes inverted index builder and boolean query processor for information retrieval.
This repo consists of all the assignments, projects, tasks of Information Retrieval course of FAST NUCES Spring 2023.
Data Processing At Scale
C++ implementations of relational operators, inverted-index-based containment and relevance keyword queries, and bulk-loaded R-Tree spatial indexing.
A search engine that ranks documents by relevance to a query using a weighting scheme, tokenization, stop word removal, and stemming
Database Systems
Inverted index and Positional index for a set of collection to facilitate Boolean Model of IR. Inverted files and Positional files are the primary data structure to support the efficient determination of which documents contain specified terms and at which proximity.
Falcon: The world fastest data analytics engine
Instructional materials (course files) for the BBT3104 course (Advanced Database Systems) and the MIT8107 course (Advanced Database Systems). Topic: Query Optimization. Based on the IMDb dataset using PostgreSQL and the Join Order Benchmark (JOB).
Tunable Query Optimizer for Web APIs and User Preferences
Skript zur Vorlesung Grundlagen der Datenbanksysteme | Lecture notes on the fundamentals of database systems
🪂 Parachute: Single-Pass Bi-Directional Information Passing (VLDB'25)
Text preprocessing, indexer constructions, and search engines implementation for information retrieval. Performance analysis done by measuring the construction time of indexers.
what if I had to make a datalog in a cabin with no internet
Search Engine for Information Retrieval Coursework
Practices on Inverted Index algorithm.
HeapQuery is an in-memory database system in Java for managing schemas and tuples, supporting basic data manipulation, relational operations, and boolean query conditions.
It simulates a simple search engine using an inverted index to efficiently retrieve documents based on user queries. It supports must-have words, optional words, and exclusion filters, enabling flexible and precise search results. The system processes multiple documents, indexes their content,and allows users to search using logical query operators
A lightweight database engine supporting CRUD, simple queries, and concurrency control with a CLI.
A library to support distributed matrix computation for machine learning and data analysis
Implementation and artifacts for "User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases"