heidihoge / cookbot

minimal crawlers for scraping recipes from various recipe sites

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

cookbot

Minimal crawlers for various recipe websites. It can be used for crawling:

TODO:

Requirements

pip install scrapy

Usage

scrapy crawl cookpad --output=cookpad.json
scrapy crawl allrecipes --output=allrecipes.json

If you want to pause and resume crawls, run it like this:

scrapy crawl cookpad -s JOBDIR=cpad-crawl --output=cookpad.json

You can stop the spider safely any time (by pressing Ctrl-C), and resume it later by issuing the same command.

About

minimal crawlers for scraping recipes from various recipe sites

License:MIT License


Languages

Language:Python 100.0%