nikitakit / self-attentive-parser

High-accuracy NLP parser with models for 11 languages.

Home Page:https://parser.kitaev.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Avoid downloading of nltk 'punkt' tokenizer

duichwer opened this issue · comments

Shouldn't the parameter preserve_line=True being added to the call of nltk.word_tokenize since there should be only one sentence everytime?

sentence = nltk.word_tokenize(sentence, self._tokenizer_lang)