Siddharth Patel's starred repositories
SparkLearning
A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.
awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
Udacity-Data-Engineering-Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
SF-EvictionTracker
Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.
Skytrax-Data-Warehouse
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.