jantrienes / HanTa

The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

HanTa

The Hanover Tagger - A simple approach to lemmatization and POS-tagging based on heuristics and hidden markov models of German morphology.

For a explanation of the underlying ideas see:

Christian Wartena (2019). A Probabilistic Morphology Model for German Lemmatization. In: Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019): Long Papers. Pp. 40-49, Erlangen.

https://corpora.linguistik.uni-erlangen.de/data/konvens/proceedings/papers/KONVENS2019_paper_10.pdf https://doi.org/10.25968/opus-1527

Please cite this paper if you use the software in your project.

About

The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models

License:GNU Lesser General Public License v3.0


Languages

Language:Python 83.1%Language:Jupyter Notebook 16.9%