tballison / file-observatory

Single server/laptop grade file-observatory

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

File Observatory

This repo hosts development code used on the backend to support data ingestion into an ElasticSearch index for the SafeDocs File Observatory app.

This repo contains pre-ALPHA grade code for demonstration purposes only.

Some capabilities demonstrated within have been integrated into Apache Tika. Some have been spun off into standalone projects, e.g. commoncrawl-fetcher-lite.

Attribution

The commoncrawl-fetcher module includes code that relies on GeoLite2 data created by MaxMind, available from https://www.maxmind.com.

About

Single server/laptop grade file-observatory

License:Apache License 2.0


Languages

Language:Java 94.7%Language:Dockerfile 4.7%Language:JavaScript 0.5%Language:Shell 0.1%Language:Python 0.0%