NASA-PDS / product_relationships

R&D development effort with RDF databases attempting to better enable management and searching of PDS4 product relationships

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Overview

A command line tool to extract basic metadata and product relationships information from PDS4 label files. Extracted information is stored in RDF Turtle files.

Installation

This is a Java Maven project. Run "mvn package" to create binary zip file (target/harvest-rdf-1.0-bin.zip) Extract generated zip file to any folder (which we will call HARVEST_HOME).

Basic Operation

Create a configuration file, for example, /tmp/harvest-rdf.xml. In this example /ws/data/context/pds4/ folder contains PDS4 labels of context products.

<?xml version="1.0" encoding="UTF-8"?>

<harvest>
    <directories>
        <path>/ws/data/context/pds4/</path>
    </directories>
</harvest>

Run harvest-rdf.bat on Windows or harvest-rdf on Unix, located in HARVEST_HOME/bin folder.

harvest-rdf -c /tmp/harvest-rdf.xml

The tool will process all PDS4 labels located in /ws/data/context/pds4/ folder and its sub-folders. By default, generated RDF Turtle file is located in /tmp/harvest-rdf/data.ttl. You can change default output folder by providing "-o" parameter.

harvest-rdf.bat -c /tmp/harvest-rdf.xml -o /tmp/rdf1

To see basic usage information, run harvest-rdf.bat or harvest-rdf without any parameters.

Usage: harvest-rdf <options>

Required parameters:
  -c <file>     Configuration file
Optional parameters:
  -o <dir>      Output directory. Default is /tmp/harvest-rdf/out
  -l <file>     Log file. Default is /tmp/harvest-rdf/harvest.log
  -v <level>    Logger verbosity: Debug, Info (default), Warn, Error

About

R&D development effort with RDF databases attempting to better enable management and searching of PDS4 product relationships

License:Apache License 2.0


Languages

Language:Java 95.9%Language:Shell 2.9%Language:Batchfile 1.2%