Book.be Scraper

About

Collects book titles from book.be. See this example of a scrape getting the titles of all books released in 2019.

Create a virtual Python 3 environment.

virtualenv venv

Enable the environment.

source venv/bin/activate

Install all requirements.

pip install -r requirements.txt

Run the spider.

scrapy runspider spider.py

Or if you want to save the output gathered by the spider, e.g. as CSV:

scrapy runspider spider.py --output=res.csv -t csv

If you want to filter the collected titles, then change URL in spider.py to include the desired filters.

Spider for book titles at book.be.

Language:Python 100.0%