A basic E-PDF parser that extracts all the Text Properties. Those include the Text, Text Font, Text Style, Text Size, Text Color. The parser performs also performs Data pre-processing by removing stopwords and punctuation.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool