piobig2871 / web_crawler

Creating web crawler for web pages

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Global App Testing - dev task

Goal of project

The goal of this project is to prepare a web crawler for any page you want and map it in python code. There is plenty of solution for crawling between the pages but I wanted prepatr some unique approach of this problem in the easy way without complicated structures and unecessary features. Idea for this project was from one of the interview tasks Global App Testing dev task - senior python developer. I hope you will enjoy this project, feel free to contribute and create issues - all of us want a good quality code and everybody are allowedd to make mistakes.

Setup


  • Python Versions Tested

cd ~
git clone https://github.com/piobig2871/web_crawler.git
cd ./web_crawler
python3 -m venv venv
. venv/bin/activate
python -m pip install -r requierements.txt

Warnings!

  1. If you want to check some web page always go to your_website.com/robots.txt to find out if this is legal

About

Creating web crawler for web pages

License:MIT License


Languages

Language:Python 100.0%