chattarajoy / Python-web-crawler

Determine the no. of links that must be followed to reach a page starting from seed page.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Python-web-crawler

Determine the no. of links that must be followed to reach a page starting from seed page.

Requirements

  • Python 2.7
  • python packages: requests, bs4

Usage

git clone
cd Python-web-crawler
python solution.py

Enter the seed page's URL and max_depth to start the crawl. Be patient for the results to show up.

About

Determine the no. of links that must be followed to reach a page starting from seed page.


Languages

Language:Python 100.0%