nyanshell / GEKIKO

An Old Chinese NLP Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GEKIKO (擊鼓)

Scripts for Old Chinese NLP work.

USAGE

N-grams

    python3 n_gram.py <input_file_or_directory>

Linguistic rules based POS tagging

    python3 pattern_predict.py <text_file> <tagged_word_list>

Count Patterns

    python3 pattern_count.py <text_file> <tagged_word_list>

WORD TAG FORMAT

Not implemented yet.

About

An Old Chinese NLP Project


Languages

Language:Python 100.0%