Timothy Hunter's repositories
spark-package-cmd-tool
A command line tool for Spark packages
dask
Parallel computing with task scheduling
databricks-cli
Command Line Interface for Databricks
dist-keras
Distributed deep learning with Keras and Apache Spark.
ecosystem
Integration of TensorFlow with other open-source frameworks
genjavadoc
A compiler plugin for generating doc’able Java source from Scala source
javacpp-presets
The missing bridge between Java and native C++ libraries
mleap
MLeap: Deploy Spark Pipelines to Production
mlflow
Open source platform for the complete machine learning lifecycle
pandas-profiling
Create HTML profiling reports from pandas DataFrame objects
protoc-gen-doc
Documentation generator plugin for Google Protocol Buffers
sagacious-squeegee
sagacious-squeegee
sbt-spark-package
Sbt plugin for Spark packages
spark
Mirror of Apache Spark
spark-deep-learning-1
Deep Learning Pipelines for Apache Spark
spark-ec2
Scripts used to setup a Spark cluster on EC2
spark-pandas-1
Koala: Pandas APIs on Apache Spark
spark-perf
Performance tests for Spark
spark-sklearn-1
Scikit-learn integration package for Spark
tensorflow
Open source software library for numerical computation using data flow graphs.
tensorframes
Tensorflow wrapper for DataFrames on Apache Spark
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow