There are 0 repository under tika-server topic.
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Extract and Visualize location from any file
📄🚀 Unleash a powerful Document Search Engine with Apache NiFi for lightning-fast, comprehensive text indexing and search.
Text extraction from scanned pdf documents in java
Tesseract OCR wrapper for Apache Tika and/or Open Semantic ETL caching the OCR results, so Tika-Server or Open Semantic ETL has not to reprocess slow and expensive OCR on same images again
Apache Tika Server as Debian GNU/Linux and Ubuntu Linux package
Web crawler with search indexing
Our project is a testament to this need, offering a comprehensive solution that combines modern technologies and architectures to create a powerful document search engine. This engine is not just a tool but a sophisticated ecosystem designed to handle complex data processing and retrieval tasks.
Configurable Tika Server docker image. https://hub.docker.com/repository/docker/kujira/tika
Run tika server forever with health check process
Container-ized (Docker) GeoTopicParser-Enabled Apache Tika Server with Lucene Geo Gazetteer.
Application in php to test load of pdf files, using docker-compose and apache-tika.
A dockerized image of Apache Tika Server - https://tika.apache.org/
A doc searcher of the documents on the local host that is based on: Tika+OCR, ElasticSearch and Kibana