refs / crawlerb

Simple web crawler. It finds script running on websites πŸ”Ž

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What?

Ruby implementation of your average web crawler. Nothing fancy.

TODO

  • get all the anchors on a website and add it to the `to_visit' list if they belong to the same domain.
  • error handling. 404 and all the like.
  • sanitize: remove empty, recognize which typeof link we encountered.
  • implement the main loop of the crawler
  • decide on stop conditions

About

Simple web crawler. It finds script running on websites πŸ”Ž


Languages

Language:Ruby 100.0%