rushitjasani / Wikipedia-Search-Engine

A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

rushitjasani/Wikipedia-Search-Engine Stargazers