dianedef / wswp_places

a testing website that enable non-blocking high speed scraping

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository contains the source code the example website used throughout the book Web Scraping with Python, published by Packt Publishing. When run locally the app will not block IP's that download faster than the thresholds specified in models/3_cache.db, which means you can test your crawler faster.

Install

This app relies on the web2py framework, which can be downloaded here and is documented here.

In the shell the installation instructions are as follows:

#!bash

    # first download web2py
    wget http://www.web2py.com/examples/static/web2py_src.zip
    unzip web2py_src.zip
    # now download the app
    cd web2py/applications
    git clone git@github.com:richardpenman/wswp_places.git places
    # now start the web2py server with a password for the admin interface
    cd ..
    python web2py.py --password=<password>

The places app can now be accessed in your web browser at http://127.0.0.1:8000/places.

About

a testing website that enable non-blocking high speed scraping


Languages

Language:Python 54.5%Language:JavaScript 18.8%Language:HTML 14.0%Language:CSS 12.6%