marthawh's repositories
spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
datashader
Turns even the largest data into images, accurately
criteo-1tb-benchmark
Benchmark of different ML algorithms on Criteo 1TB dataset
pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
hello_world
test GitHub knowledge
Data-Analysis-and-Machine-Learning-Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
scikit-learn-videos
Jupyter notebooks from the scikit-learn video series
GraphLab-Create-SDK
SDK for Turi's GraphLab Create.
python-libffm
A Python wrapper for the libffm library.
Python-Machine-Learning-Cookbook
Code files for Python-Machine-Learning-Cookbook
DAT8
General Assembly's Data Science course in Washington, DC
DAT4
General Assembly's Data Science course in Washington, DC
document_cluster
A guide to document clustering in Python
pyspark-examples
Example Apache Spark and Python Scripts