mmihaltz / trendminer-hunlp

Hungarian NLP pipeline for social media text analysis (TrendMiner project)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Trendminer Hungarian Processing Pipeline (trendminer-hunlp): a suite of scripts that perform Hungarian NLP processing steps (tokenization, pos-tagging, morphological analysis, lemmatization) by extending existing tools (huntoken, hunmorph, hunpos) to be able to deal with some of the challenges presented by the special language of social media messages, which differs from the domain of standard language (generally newswire) texts that were used to develop and train the exising tools.

Trendminer Project:

Author: Márton Miháltz mmihaltz@gmail.com

About

Hungarian NLP pipeline for social media text analysis (TrendMiner project)

License:GNU General Public License v2.0


Languages

Language:Python 100.0%