ParzivalHack / T-Crawl

A simple web crawler (spider) written in Python

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is Web Crawling?

A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses websites for the purpose of Web indexing (web spidering). Screenshot_2022-09-06-14-53-51-33_84d3000e3f4017145260f7618db1d683

Tool info

T-Crawl is a web crawler (spider) written in Python. This is how it works:

  • It downloads the HTML from a webpage.
  • It parses the HTML to extract links.
  • It prints the links collected.

Installation of T-Crawl

Usage

  • cd T-Crawl
  • chmod +x T-Crawl.py
  • python T-Crawl.py

License

This tool is under the GPL v.3 License.

© 2022 Tommaso Bona

About

A simple web crawler (spider) written in Python

License:GNU General Public License v3.0


Languages

Language:Python 100.0%