Aaqib Ahmed Nazir (aaqib-ahmed-nazir)

aaqib-ahmed-nazir

Geek Repo

Location:Pakistan

Github PK Tool:Github PK Tool

Aaqib Ahmed Nazir's repositories

BDA_Assignment02

This repository aims to develop a basic search engine utilizing Hadoop's MapReduce framework to index and process extensive text corpora efficiently. The dataset used for this project is a subset of the English Wikipedia dump, totaling 5.2 GB in size. The project focuses on implementing a naive search algorithm to address challenges in information.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0