cdk-dev / link-scraper

Extract Preview Data from Websites

Home Page:https://cdk.dev

Repository from Github https://github.comcdk-dev/link-scraperRepository from Github https://github.comcdk-dev/link-scraper

Content Preview Scraper

This uses Playwright to extract content previews from a givenn url. This includes:

  • Generic Metadata from Dom
  • Open Graph Metadata
  • Twitter Tags Metadata
  • Screenshot (viewport / full)

Still to do: Scrape author data from a social media profile such as Twitter, Github, LinkedIn

About

Extract Preview Data from Websites

https://cdk.dev

License:Apache License 2.0


Languages

Language:TypeScript 83.0%Language:JavaScript 17.0%