Nick Pentreath's repositories
elasticsearch-vector-scoring
Score documents with pure dot product / cosine similarity with ES
scalanlp-core
ScalaNLP's core library. Provides useful routines for natural language processing (NLP) and machine learning.
sparklingpandas
Pandas On PySpark(POPS)
sse17-meetup
Boston ML Meetup - Spark Summit East 2017
mxnet-the-straight-dope
An interactive book on deep learning, in concept and in MXNet
pyspark-converter-examples
Writing Hadoop input/output data converters for PySpark
spark-jobserver
REST job server for Spark
spark-libFM
An implement of Factorization Machines (LibFM)
spark-vl-bfgs
Vector-free L-BFGS implementation on Spark
sparklingml
Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)
beijing-meetup-2016
Jupyter Notebooks for Apache Spark ML Meetup - Beijing Nov 2016
elasticsearch-hadoop
Elasticsearch real-time search and analytics natively integrated with Hadoop
google-java-format
Reformats Java source code to comply with Google Java Style.
incubator-spark
Mirror of Apache Spark
incubator-systemml
Mirror of Apache SystemML (Incubating)
spark-perf
Performance tests for Spark
staged-recipes
A place to submit conda recipes before they become fully fledged conda-forge feedstocks