mukeshkdangi / crawler_nypost

Crawling web pages and indexing for solr search

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nypost_crawler

In this assignment, we worked with a simple web crawler to measure aspects of a crawl, study the characteristics of the crawl, download web pages from the crawl and gather webpage metadata, all from pre-selected news websites.

About

Crawling web pages and indexing for solr search


Languages

Language:Java 100.0%