mpboddie / crawler

Just trying out some web crawling.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

crawler

Just trying out some web crawling.

  1. Interact with website to obtain parsed results
  2. Determine which results show be acted upon
  3. Download file, upload it to other service, delete downloaded file
  4. Check for completed files from other service and copy to server in appropriate location
  5. Profit?

Prerequisites

Python 3.x (3.5.3 was used as of this writing) PhantomJS (2.1.1 was used)

Run it

  1. Configure it by customizing config.json and search.json (they should be self explanatory, if you don't understand them then this probably isn't the program you are looking for)
  2. Run main.py using Python 3

About

Just trying out some web crawling.


Languages

Language:Python 100.0%