kbussell / pyporterstemmer

python C-extension implementing the Porter Stemming algorithm

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pyporterstemmer

python C-extension implementing the Porter Stemming algorithm, modified from the C version written by Martin Porter (http://tartarus.org/~martin/PorterStemmer/)

This implementation requires input be unicode strings

Sample Usage

>>> from PorterStemmer import stem
>>> stem(u'running')
u'run'
>>> stem(u"collaboration")
u'collabor'

About

python C-extension implementing the Porter Stemming algorithm


Languages

Language:C++ 89.8%Language:Python 10.2%