smartfeature / nlp-doc-dataset

nlp document classfication datasets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool


nlp document classfication datasets

three small datasets: places vs orgs

people vs ogrs

places vs people

letter_dict is a a doctionary for the selected words in all documents.

dict_int is the map between key storeds in text data files and each word in letter_dict.


nlp document classfication datasets