okdistribute / nutella-scrape

:chocolate_bar: learn to scrape the web with Node.js -- it tastes like chocolate

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nutella-scrape

NPM

nutella

  1. Run sudo npm install nutella-scrape -g
  2. Run nutella-scrape
  3. ???
  4. LEARN!!

In this tutorial, we will work through how to scrape websites using Node.js for the primary purpose of using it in other programs -- in servers, frontends (yes, Node works in the browser!), or just writing a table to disk for analysis elsewhere.

The DOM (Document Object Model) is an abstract concept describing how we can interact with HTML. JavaScript is GREAT for traversing HTML (i.e., the DOM) because it was made to work with HTML in the first place.

TODO

  • parallel
  • spoofing
  • cookies/login walls
  • electron-microscope

About

:chocolate_bar: learn to scrape the web with Node.js -- it tastes like chocolate


Languages

Language:JavaScript 100.0%