return-none / stack

Scrapy powered stackoverflow crawler

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Stackoverflow crawler

Scapy powered spider crawling newest 50 themes titles from stackoverflow.com

How to run

Installing scrapy from pip(latest version)

pip install scrapy

clone this repo, cd to working dir and start crawling!

scrapy crawl stack

Enjoy results

Briefly about structure

stack/items.py items fields defenitions

stack/pipelines.py pipelines for parsing. Not using in current version since script is very simple and more like proof of concept

stack/setting.py Main settings file more info

stack/spiders/stack_spider.py spider for crawling. Main logic here

About

Scrapy powered stackoverflow crawler


Languages

Language:Python 100.0%