Addono / bookbe-spider

Spider for book titles at book.be.

Home Page:https://gist.github.com/Addono/de7b0633d7faa1da3aeaf1f43985b163

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Book.be Scraper

About

Collects book titles from book.be. See this example of a scrape getting the titles of all books released in 2019.

Installation

Create a virtual Python 3 environment.

virtualenv venv

Enable the environment.

source venv/bin/activate

Install all requirements.

pip install -r requirements.txt

Usage

Run the spider.

scrapy runspider spider.py

Or if you want to save the output gathered by the spider, e.g. as CSV:

scrapy runspider spider.py --output=res.csv -t csv

If you want to filter the collected titles, then change URL in spider.py to include the desired filters.

About

Spider for book titles at book.be.

https://gist.github.com/Addono/de7b0633d7faa1da3aeaf1f43985b163


Languages

Language:Python 100.0%