Juliet Hougland's repositories
ds-for-telco
Source material for Data Science for Telecom Tutorial at Strata Singapore 2015
sparklingpandas-ex
Examples of using SparklingPandas and Pandas with PySpark
py-hadoop-tutorial
Source Material for using Python and Hadoop together
ttitd-traffic
That thing in the desert has traffic. What is it like?
svd-benchmark
A repo for benchmarking distributed implementations of the singular value decomposition.
mllib-utils
Some wrapper utilities for working with Spark MLLib.
sk-score-ex
Example of applying a fit sklearn model to a distributed dataset using pyspark.
sparklingpandas
Pandas On PySpark(POPS)
whattreeisthis
What tree is this? A progressive web app that teaches users how to identify trees.
compare-a-frame
Serde Comparisons for Pandas DataFrames
gen-lin-models
An IPython notebook explaining generalized linear models, particuarly for count data.
jhlch.github.io
A place to write.
parksconserverancy
Data project for Conservancy Vegetation Monitoring Data
parquet-mr
Mirror of Apache Parquet
py-env-parcel
Scripts for building CDH parcels to distribute python enviroments.
ranger-survey
Black Rock City Ranger Survey Analysis
zika-hackathon
Data Science Hackathon with UT Austin | Mosquito Transmitted Viruses