A tiny wrapper parses text from document files like pdf
, docx
etc. and passes it to GNFinder for extracting names
go install
go build
pre-built binaries by GitHub Actions can be downloaded from releases
# below command starts gnfinder-plus on 3006 port
# different port can be given by -port=3006 etc.
./gnfinder-plus
# in another window
curl http://localhost:3006/parse?file=url
wget https://indiabiodiversity.org/biodiv/content/documents/document-0162468a-7ce7-499e-ac6d-ead2dc273c35/687.pdf
./gnfinder-plus -file=687.pdf
in some cases pdftotext
binary might be missing please install according to your os
sudo apt install poppler-utils # debian