francisyzy / AJP_Assignment2

This java project is a multithreaded web crawler that uses three search engine, Bing, Yahoo, and Google to generate seeds to crawl the website.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AJP_Assignment2

This java project is a multithreaded web crawler that uses three search engine, Bing, Yahoo, and Google to generate seeds to crawl the website. User will be prompted to enter their search pharase.

Search engines Bing, and Yahoo is crawled via HTTP request where as Google uses Google Custom search engine to get their website seeds.

There will be ten web pages downloaded into the local machine. The files will be overwritten. The webpage download can show the number of occurrence to the search pharase.

This project uses JavaFX as a GUI for the user to interact with. To edit JavaFX quickly, use JavaFX Scene Builder

Assignment Brief and Specs

Please star the project if you find it useful!

Usage

To run this project, clone it in NetBeans and generate google search engine API key here and place it in here at line 27

About

This java project is a multithreaded web crawler that uses three search engine, Bing, Yahoo, and Google to generate seeds to crawl the website.


Languages

Language:Java 100.0%