dumoulma / fic-caissepop

Library of Pig UDF functions useful for tokenizing, computing TF-IDF or BNS values and vectorizing kdd1999 corpus.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

fic-caissepop

Library of Pig UDF functions useful for tokenizing, computing TF-IDF or BNS values and vectorizing kdd1999 corpus.

About

Library of Pig UDF functions useful for tokenizing, computing TF-IDF or BNS values and vectorizing kdd1999 corpus.


Languages

Language:Java 100.0%