apache / tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Home Page:https://tika.apache.org/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

apache/tika Stargazers