bact / RDRPOSTagger

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

Home Page:http://rdrpostagger.sourceforge.net

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RDRPOSTagger

Python 3.4 License: GPL v3

enrdr3

RDRPOSTagger is a robust and easy-to-use toolkit for POS and morphological tagging. It employs an error-driven approach to automatically construct tagging rules in the form of a binary tree.

  • RDRPOSTagger obtains very fast tagging speed and achieves a competitive accuracy in comparison to the state-of-the-art results. See experimental results including performance speed and tagging accuracy for 13 languages in our AI Communications article.

  • RDRPOSTagger now supports pre-trained UPOS, XPOS and morphological tagging models for about 80 languages. See folder Models for more details.

  • See USAGE.md for full details on how to train and use the tagger.

Architecture

The general architecture and experimental results of RDRPOSTagger can be found in our following papers:

Please CITE either the EACL or the AICom paper whenever RDRPOSTagger is used to produce published results or incorporated into other software.

Distribution

Current release (41MB .zip file containing about 330 pre-trained tagging models) is available to download at: https://github.com/datquocnguyen/RDRPOSTagger/archive/master.zip

Find more information about RDRPOSTagger at: http://rdrpostagger.sourceforge.net/

In addition, you might want to try my neural network-based toolkit jPTDP for joint POS tagging and dependency parsing.

About

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

http://rdrpostagger.sourceforge.net

License:GNU General Public License v3.0


Languages

Language:HTML 46.5%Language:Python 38.5%Language:Java 15.0%