Alphx-rgb / Web-Crawler-CLI-Tool

A CLI Tool for Web Crawling and Scrapping

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Web-Crawler

Project of WOC

  • This project is about a web crawler,a tool which extracts information about webpages.
  • I will use Python language for the project.
  • This Crawler crawls over the internet and stores links,images and screenshots of linkavailable onthe website.
  • for further help: use command "python WCSC.py man" or "python WCSC.py -help"

Modules/libraries used:

  • tldextract

  • selenium

  • os

  • bs4

  • requests

  • sys

  • termcolor

  • itertools

  • keyboard

  • time

  • re Below are some Snippets of working of the tool:

  • man_page man_page

  • Headers Headers

Note: Enter email : https://github.com and depth : 1 or greater than 1

About

A CLI Tool for Web Crawling and Scrapping


Languages

Language:Python 100.0%