nihalmpatel / web-search-engine

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Web Search Engine

Advanced Computing Concepts Group Project - COMP8547-4-R-2019F - University of Windsor

The project is structured in three modules:

  1. react-searchkit: Frontend UI powered by React.js

  2. search-engine-backend: Backend REST API developed using Spring Boot framework which retuns the search result

  3. WebCrawler: Web crawler to crawl through pages

Techniques used:

  • Web crawler with Jsoup
  • Inverted index with Trie
  • Regex
  • Edit distance
  • HashMap
  • Quick Sort

Working Flow:

Working flow diagram

Trie Structure

trie structure example

Search HashMap Structure - Example

example

About


Languages

Language:Java 100.0%