denalimarsh / Search-Engine

A scalable search engine which indexes the web and returns appropriate results.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Search Engine

This is the backbone of a scalable, optimized search engine written in Java which indexes large word files utilizing a custom InvertedIndexBuilder class. The search engine can connect to the internet and crawl links, indexing the cleaned HTML text while recursively crawling subsequently found links. Originally developed at the University of San Francisco as the final project for CS212 Software Development.

About

A scalable search engine which indexes the web and returns appropriate results.


Languages

Language:Java 100.0%