momtr / web-scraping

πŸš€πŸ“ˆ web scraping with cron jobs (every minute) using puppeteer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

web-scraping

πŸš€web scraping with cron jobs (every minute) using pupeteer

Every minute the cron job is executed and the website is saved in pdf format.

NPM Packages

  • puppeteer (for web scraping)
  • express (for server)
  • node-cron (for cron jobs)

Install

Clone this repo:

$ git clone https://github.com/moritzmitterdorfer/web-scraping.git

Install NPM packages:

$ npm install

Configuration: Open the config.js file and specify:

const config = {
    page_url: 'PAGE_URL',
    dir: 'savedData',
    name: 'FILE_NAME_BASIC',
    cron: 'CRON_CONDITION {example: * * * * *}'
};

Now, you can start the service:

npm start

About

πŸš€πŸ“ˆ web scraping with cron jobs (every minute) using puppeteer

License:MIT License


Languages

Language:JavaScript 100.0%