UWNETLAB / Nate

Research at the intersection of natural language processing and social network analysis.

Home Page:http://networkslab.org/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Implement checks for preprocessing (and run if missing) for each pipeline

pbrowne88 opened this issue · comments

Currently, it's necessary to run nate.preprocess to get the spaCy data necessary to instantiate a nate pipeline.

First, pipelines should elegantly check to see if the necessary preprocessing has been completed. This should be simple and is a logical endpoint.

For further user friendliness, though, it would be prudent to enable each of the pipeline-returning functions to also run preprocessing using defaults that will configure the preprocess function to meet their requirements.

Low priority.