cympfh / nlptk

NLP tool kit

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nlptk

NLP toolkit written in C++11

For detail, please see wiki or README file in each directory.

ngram

  • enumerate ngrams of a document
  • allowing multi-document
  • outputs pairs of frequency and n-gram

tfidf

  • calculates from tf-file

aprf

  • caluculate Acc, Prec, Rec and F1-socre
  • answer and predict files are written in SVM-light like format

TODO

micro,macro-avg

About

NLP tool kit


Languages

Language:C++ 90.6%Language:Ruby 8.9%Language:Makefile 0.5%