Scott Haines's repositories
spark-moderndataengineering
The source code for the book Modern Data Engineering with Apache Spark
hitchhikers_guide_to_deltalake_streaming
Don't Panic. This guide will help you when it feels like the end of the world.
odsc-west-streaming-trends
All Data, Relevant Information, Scripts, and Applications for the Open Data Science Conference (2018)
spark-intro-to-ml
A Gentle introduction to Machine Learning with Apache Spark
spark-summit-2018
Spark Application : Spark Summit 2018 : Streaming Trend Discovery
odsc-east-2020-decision-intelligence
This is the home of the 2020 Open Data Science Conference workshop (Creating Streaming Predictive Analytics and Decision Intelligence Systems with Apache Spark)
odsc-east-realish-predictions
Material for the 2019 ODSC East Workshop (Realish Time Predictive Analytics with Spark Structured Streaming)
spark-inception
This project is available free of charge as a companion to my Data+AI Summit (2022) talk.
svcc-2019-realish-spark
This is the material for the 2019 Silicon Valley Code Camp Session "Realish Time Predictive Analytics with Spark Structured Streaming"
dailyhacking
source files
docker-spark-base
Creates a customizable base image for working with Apache Spark
odsc-east2019-warmup
Warmup Presentation for The 2019 Open Data Science Conference in Boston
datariders
Meetup Presentations
delta-docs
Delta Lake Documentation
elasticsearch-hadoop
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
gameengine
Here is a take on building a game engine with html5
learn-spark-elasticsearch
This is a Docker environment for running ElasticSearch, Kibana, Spark and Zeppelin
odsc-west-2019-realtime-analytics
Workshop Material for Near RealTime Predictive Analytics with Apache Spark Structured Streaming Workshop at the Open Data Science Conference WEST 2019
parallel.js
Parallel.js is a tiny library for multi-core processing in Javascript.
spark-expectations
A Python Library to support running data quality rules while the spark job is running⚡
unitycatalog
Open, Multi-modal Catalog for Data & AI
webworkers-js
Javascript Webworkers Playground