luiseduardiazc / contactcatch_node

Scraping project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

contactcatch_node

This project was made for learning purposes. The main idea was scrape comments from facebook posts and store it in csv file.

Installation

Install the dependencies and devDependencies and start the server.

  • Install chromium engine for Puppeteer
$ sudo apt-get install gconf-service libasound2 libatk1.0-0 libatk-bridge2.0-0 libc6 libcairo2 libcups2 libdbus-1-3 libexpat1 libfontconfig1 libgcc1 libgconf-2-4 libgdk-pixbuf2.0-0 libglib2.0-0 libgtk-3-0 libnspr4 libpango-1.0-0 libpangocairo-1.0-0 libstdc++6 libx11-6 libx11-xcb1 libxcb1 libxcomposite1 libxcursor1 libxdamage1 libxext6 libxfixes3 libxi6 libxrandr2 libxrender1 libxss1 libxtst6 ca-certificates fonts-liberation libappindicator1 libnss3 lsb-release xdg-utils wget
  • clone repo
$ git clone https://github.com/luiseduardiazc/contactcatch_node.git
$ cd contactcatch_node
$ npm install

Video Demo!

asciicast

About

Scraping project


Languages

Language:JavaScript 87.7%Language:HTML 12.3%