Will Jones's repositories

data-science-ipython-notebooks

Continually updated data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe), scikit-learn, Kaggle, Spark, Hadoop MapReduce, HDFS, matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. https://bit.ly/data-notes

Language:PythonLicense:NOASSERTIONStargazers:0Issues:3Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

docker-spark

Apache Spark Standalone & Docker

Stargazers:0Issues:2Issues:0

gostatic

This is a bare bones simple static file server, written in Go. Its about as basic as it can get. It logs all requests to STDOUT. I use this as a Python SimpleHTTPServer replacement.

Language:GoLicense:UnlicenseStargazers:0Issues:2Issues:0

kube-airflow

A docker image and kubernetes config files to run Airflow on Kubernetes

Language:MakefileLicense:Apache-2.0Stargazers:0Issues:2Issues:0

luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

sic4-list

A list of 4 digit SIC codes with descriptions for download

Stargazers:0Issues:2Issues:0