Fabien Poulard's starred repositories
nltk-trainer
Train NLTK objects with zero code
uima-word-tokenizer
A word tokenizer component for UIMA that take advantage of unicode general classes. The tokenizer only handles French for the moment, but can be extended quite easily.