donaldh / example-applications

Example applications for use with PNDA

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Example Applications

This repository contains a number of example applications that can be built and run on PNDA. Each application directory contains more detailed information.

Spark Streaming

  • Examples of consuming data from Kafka and populating both HBase and OpenTSDB with simple Scala based Spark Streaming applications.

Spark

  • Example of consuming data ingested by Gobblin on a batch basis and producing Parquet datasets, optimized for consumption by Impala.

Jupyter

  • Example of a notebook for manipulating network data.

H2O

  • Application that runs the H2O data science platform as an application on PNDA.

Compound Packages

  • An example of a package containing multiple application component types, in this case a Spark app and related Jupyter notebook.

About

Example applications for use with PNDA

License:Other


Languages

Language:Scala 41.7%Language:Java 24.6%Language:Jupyter Notebook 22.8%Language:Python 10.7%Language:Shell 0.2%