adobe / NLP-Cube

Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing

Home Page:http://opensource.adobe.com/NLP-Cube/index.html

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Full training and model import example

dumitrescustefan opened this issue · comments

We need to provide a jupyter example and create the associated script that will allow users to:

  1. download the UD corpus
  2. train a full model
  3. package the full model in a zip file
  4. import the model in their local model storage folder

Please update Advanced usage - Training a model on the UD Corpus.ipynb to mention the wiki.dummy.vec. It is useful when you want to train smaller models. Also, it is important to use the same dummy word emebeddings for all the smaller models. This means, that we want users to download this file and use it as input, so we don't have any download conflicts.