chandnipatelTW's repositories
wonderland-fsharp-katas
self-contained F# script, and a companion instructions file for katas
tw-twdu-2a
Streaming Data Pipeline Code from TwoWheelers for TWDU-2a
advanced-transformation
This is katas for week 2 comprising of:
transformations-with-pyspark
Repository with transformations with pyspark.
infra-twdu2b
Basic infrastructure for TWDU 2 Team B
infra-twdu2a
Infrastructure set up for TWDU2 group A
infra-gcube
Infrastructure repo for GCube
streaming-pipeline
Golden copy of the streaming data pipeline used for the TwoWheelers client simulation session in the data engineering training program.
basic-aws-infrastructure
Golden Copy to help you set up a basic AWS environment with an EMR cluster behind a VPC.
transformations
Katas for transforming data with Spark + Scala.
Airflow-Example-DAGS
DAGS used to automate citibike and wordcount apps
crime-data-transformations
A repository that analyzes crime data using Spark + Scala
airflow_examples
Repo of airflow examples people have made to automate the word count or citibike examples locally and in AWS.
spring-security-saml
SAML extension for the Spring Security project
data-transformations
Started code base for Spark + Scala project.
kube-airflow
A docker image and kubernetes config files to run Airflow on Kubernetes
streaming-data-pipeline
Streaming pipeline repo for data engineering training program
python-getting-started
Getting Started with Python on Heroku.
docker-curriculum
:dolphin: A comprehensive tutorial on getting started with Docker!
join-transformations
This repository will walk you through several katas for learning how to do joins with Spark+Scala.
basic-transformations
This repository has been built to help people learn how to do basic transformations on a single DataFrame in Spark + Scala.
semi-structured-data-transformations
This repository has been built to help people learn how to work with semi-structured data sources with Spark+Scala.
ReduxSimpleStarter
Starter pack for an awesome Udemy course
specifying-schema
Show different ways to specify schema with Spark + Scala, and potential issues that can occur.
essential-data-developer
Course outline, learning resources and some code for "essential data developer" course