ZeyadTarekk / Dawrly

Crawler based search engine that demonstrates the main features of a search engine (web crawling, indexing and ranking) and the interaction between them.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GitHub contributors GitHub issues GitHub license GitHub forks GitHub stars GitHub Language

📝 Table of Contents

📙 About

  • Crawler based search engine that demonstrates the main features of a search engine (web crawling, indexing and ranking) and the interaction between them.

📃 Documentation

  • index.html file in documentation folder

🏁 Getting Started

This is an list of needed instructions to set up your project locally, to get a local copy up and running follow these instructuins.

Prerequisite

  1. Apache Tomcat 9
  2. MongoDB
  3. Any Java IDE (Intellij in our case)

Installation

  1. Clone the repository
$ git clone https://github.com/ZeyadTarekk/Dawrly.git
  1. Right click on the folder and open it with the IDE

Running

Running program

Run Crawler_Indexer_Main to start the Crawler then the Indexer
unzip "apache-tomcat-9.0.62.zip"
Copy the content of Interface file to "apache-tomcat-9.0.62\webapps\SearchEngine"
Copy "stopwords.txt" and "bodyFiles" to "apache-tomcat-9.0.62\bin"
Run main function in RunGUI class

💻 Built Using

  • Java

📷 Demo Screenshots

Main Screen

Suggestion Mechanism

Results

Pagination of results

Phrase Searching

Contributors

Zeyad Tarek
Zeyad Tarek
Abdelrahman Mohamed
Abdelrahman Mohamed
Beshoy Morad
Beshoy Morad
Ziad Sherif
Ziad Sherif

License

This software is licensed under MIT License, See License for more information ©ZeyadTarekk.

About

Crawler based search engine that demonstrates the main features of a search engine (web crawling, indexing and ranking) and the interaction between them.

License:MIT License


Languages

Language:Java 62.7%Language:HTML 25.3%Language:JavaScript 9.0%Language:CSS 3.0%