vambati / pypln

Pipeline for distributed Natural Language Processing, made in Python

Home Page:http://namd.github.com/pypln

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PyPLN

PyPLN is a distributed pipeline for natural language processing, made in Python. We use NLTK and ZeroMQ as our foundations. The goal of the project is to create an easy way to use NLTK for processing big corpora, with a Web interface.

We don't have a production release yet, but it's scheduled on our next milestone.

Documentation

Our documentation is hosted using GitHub Pages:

Requirements

To install dependencies (on a Debian-like GNU/Linux distribution):

sudo apt-get install python-setuptools
pip install virtualenv virtualenvwrapper
mkvirtualenv pypln
pip install -r requirements.txt

Developing

To run tests:

workon pypln
pip install nose
make test

See our code guidelines.

About

Pipeline for distributed Natural Language Processing, made in Python

http://namd.github.com/pypln