Use lemmatization and parts of speech tagging for token matching
rafelafrance opened this issue · comments
This will require separate regex search strategies for tokeninzation and parsing as well as using token byte strings for the searching. We're going to be slinging bytes in and out of the token matches so this willl become compute intensive. Cython?
Done.