dirtystylus / webscraping-nodejs

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

There’s a few files here, and a lot of hard-coded stuff, so YMMV.

  • prep_books.js: I used this to set up a list of URLs pointing to the site I wanted to scrape
  • book_queries.json: The results of prep_books.js get written to this file
  • crawl_books.json: This walks through the list of URLs, grabs the data, parses it, then makes a second request for the book image and saves that to disk

About


Languages

Language:JavaScript 100.0%