USCDataScience / parser-indexer-py

Python tools for parsing documents and building the inverted index with enriched metadata. Java version with slightly different features - https://github.com/USCDataScience/parser-indexer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Feature to output the named entity positions in .ann output format

thammegowda opened this issue · comments

The .ann format shall be compatible with BRAT http://brat.nlplab.org/configuration.html

Also output the .txt file, which is the plain text of the documents

+1 sounds great