scienceguyrob / ABigDataPipelineForHighVolumeScientificDataStreams-SupportingPythonSource

A Big Data Pipeline for High Volume Scientific Data Streams - Supporting Python Source

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Supporting Material: A Big Data Pipeline for High Volume Scientific Data Streams (version 1.0)

This notebook provides python source code for the examples presented in the recent paper, "A Big Data Pipeline for High Volume Scientific Data Streams".

DOI

Author: Rob Lyon

Overview

This repository consists of,

  • an ipython notebook that explores and describes the code discussed in the paper.
  • a pdf version of the notebook for those without Jupyter notebooks (formatting may be poor).

Note the notebook contains interactive Python 2.7 code. It requires Numpy, Scipy, and Scikit-learn.

We kindly request that if you make use of the notebook, please cite the work using the repository DOI.

License

The code and the contents of this notebook are released under the GNU GENERAL PUBLIC LICENSE, Version 3, 29 June 2007. We kindly request that if you make use of the notebook, you cite the work appropriately.

Acknowledgements

I gratefully acknowledge the co-authors of the paper for their efforts.

Change log

First version.

About

A Big Data Pipeline for High Volume Scientific Data Streams - Supporting Python Source


Languages

Language:Jupyter Notebook 100.0%