Simple app for scrapping data from gumtree.
The project was created for learning purposes to know how to combine scrapy framework with TorIp changer.
- Docker desktop
.
βββ docker-compose.yml
βββ LICENSE
βββ README.md
βββ src
βββ crawler
β βββ __init__.py
β βββ items.py
β βββ middlewares.py
β βββ pipelines.py
β βββ settings.py
β βββ spiders
β βββ __init__.py
β βββ mieszkania2.py
β βββ quotes_spider.py
βββ Dockerfile
βββ go_spider.py
βββ scrapy.cfg
βββ tests
βββ ipchanger_works.py
Clone repository:
git clone https://github.com/Santhin/TorScrapy.git
To run the crawler type:
docker-compose up
Simple check if tor ip changer is working unmark commented test in dockerfile.
The exemplary output:
- add control startup for TorIpChanger container in docker-compose
- Scrapy - Crawler
- TorIpChanger - Privoxy + Tor
- Hat tip to DusanMadar for amazing framework and tutorial step by step https://github.com/DusanMadar/TorIpChanger https://gist.github.com/DusanMadar/8d11026b7ce0bce6a67f7dd87b999f6b