mukeshkdangi / nypost_searchengine

Crawled and stored metadata of web pages using multithreaded crawler. Used GCP Hadoop cluster to create inverted index. Developed custom page rank algorithm and exposed RESTful APIs with spellchecker and autocomplete features.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

mukeshkdangi/nypost_searchengine Issues

No issues in this repository yet.