Ziinc / crawldis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

app/web: Can create a spider

Ziinc opened this issue · comments

commented

Can CRUD the spider.

What is a spider

Spiders specify a specific page parsing pattern. E.g. extract all urls with this glob pattern, then extract all text with this xpath.

Spiders are made up of a group of selected Parser modules and a configuration for each. They will perform the parsing of new Requests and ParsedItems.

commented

superceded with #4 , modelling requesting with Crawl job instead of Spider