WSGI Sandbox

This project contains some WSGI deployment demonstrations using Python 2.7.3.

Motivation:

Deploy standalone WSGI applications with a solid httpd proxy in front.
Support gracefully reloading application code without dropping any requests.
Configure WSGI applications with production loggers.
Have all loggers go to a single destination, per application. (Generally, have logging configuration you can actually understand.)
Rotate log files with a simple configuration.
Keep all production configuration in obvious places.

Demonstration:

Set up gunicorn with nginx proxying in front.
Run gunicorn via supervisord, gracefully reload with pid from supervisorctl.
Configure Python's standard library logging to each WSGI application.
Use stderr logging handlers, pass through gunicorn, capture with supervisor.
Rotate log files with supervisord.
Put all production configuration in supervisord.conf.

Questions for the DevOps team, as many are policy related:

How do we manage sites with nginx, supervisord, and gunicorn configurations?
How do we lock down supervisorctl but still let deploy script get the pid?
How should we standardize logger formats?
How do we keep coherent log output given concurrency?
Do we care that this setup does not compress log backups?
Where does gunicorn configuration belong?

Install Python dependencies in a virtualenv:

pip install -r requirements.txt

Kick everything off by starting supervisor, which launches gunicorn processes:

supervisord

Proxy via nginx, separate from the system-wide httpd for the purpose of this demonstration; this may require sudo depending on how nginx is configured (compare configurations with nginx -V):

nginx -p . -c nginx.conf

Generate a lot of concurrent requests in order to see some action:

ab -c 10 -n 10000 http://flask.local.willowtreeapps.com:8000/
ab -c 10 -n 10000 http://django.local.willowtreeapps.com:8000/
ab -c 10 -n 10000 http://wsgi.local.willowtreeapps.com:8000/

This generates 10000 requests with a concurrency level of 10, using ab (ApacheBench). Docs are at http://httpd.apache.org/docs/2.4/programs/ab.html.

On Ubuntu:

sudo apt-get install apache2-utils

To gracefully reload gunicorn:

kill -HUP `supervisorctl pid flask_instance`
kill -HUP `supervisorctl pid django_instance`
kill -HUP `supervisorctl pid wsgi_instance`

Prove that graceful reloading does not drop requests by issuing a HUP signal to gunicorn while under heavy load of ab. "Failed requests" according to ab includes responses which have inconsistency of content length -- the test is to look for Non-2XX responses in ab's output.

A note on following the logs. supervisorctl provides a useful tail command, which supports tail -f. However, it's buffered by default. If you want to see some action, instead of e.g. supervisorctl tail -f flask_instance, use tail -f flask_instance.log directly.

rduplain / wsgi-sandbox

WSGI Sandbox

About