reformatco / jimdo-scraper

Test work on using node to scrape blog from a jimdo site

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Jimdo scraper using Node.js

Initial experiment using cheerio and sanitize-html to scrape a Jimdo blog enabling you to export content to another CMS.

To do

  • scrape pagination
  • scrape each pages within the blog
  • save to json file of all blog posts with links, date, title etc
  • iterate through each blog post saving data into
    • single json file
    • separate json files

into wp

  • create post with title
  • make first image element the featured image
  • import

About

Test work on using node to scrape blog from a jimdo site


Languages

Language:JavaScript 100.0%