CodeForAfrica-ARCHIVE / Scrapengine

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Scrapengine

A generic scraping engine that schedules, loads, runs crawlers and outputs to an external data store.

Objectives

  • Self contained pluggable scrapers
  • Scraper execution engine that loads and executes the scrapers at pre-configured intervals
  • Emit output to pre-configured data stores
  • Web dashboard: admin, monitoring, and metrics

alt text

Status

Pre-development

Documentation

About

License:MIT License


Languages

Language:Python 99.6%Language:Shell 0.3%Language:Makefile 0.1%