victorgianvechio / xtractorr

Crawler Web Scraping with Puppeteer and IBM Watson Natural Language Understanding

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Xtractorr

Crawler Web Scraping with Puppeteer and IBM Watson Natural Language Understanding

All sources are available here

Extract NPM Commands

scrap all informational sources

npm rum xtract

scrap news

npm rum xtract-noticia

scrap events

npm rum xtract-evento

scrap editals

npm rum xtract-edital

MongoDB

Version 4.0.2

Insert NPM Commands

insert all informational sources

npm rum insert

insert news

npm rum insert-noticia

insert events

npm rum insert-evento

insert editals

npm rum insert-edital

or

scrap and insert all informational sources

npm rum xtract-insert

Scraped files can be found in

/result

Copyright ® 2019 Victor Gianvechio

About

Crawler Web Scraping with Puppeteer and IBM Watson Natural Language Understanding

License:MIT License


Languages

Language:JavaScript 48.2%Language:HTML 41.4%Language:CSS 10.4%