aaqib-ahmed-nazir / BDA_Assignment02

This repository aims to develop a basic search engine utilizing Hadoop's MapReduce framework to index and process extensive text corpora efficiently. The dataset used for this project is a subset of the English Wikipedia dump, totaling 5.2 GB in size. The project focuses on implementing a naive search algorithm to address challenges in information.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

aaqib-ahmed-nazir/BDA_Assignment02 Stargazers