Checking performance with reading PDF and:
- gathering info about the number of pages using python libraries.
- ... some day ...
Current stable version: v1.0
Release date: 26.03.2019
Maciej Januszewski (maciek@mjanuszewski.pl)
- Firstly run Apache-Tika Server (for Tika purposes):
docker pull logicalspark/docker-tikaserver
docker run -d -p 9998:9998 logicalspark/docker-tikaserver
./run.py <path/to/pdfs_data/>