Karen Zhang's repositories
airflow_data_pipelines
ETL pipelines using Apache Airflow, that transforms data from various sources into a star schema.
Calgary_traffic_analysis
ENSF592 project that analyzes traffic information in Calgary.
data-engineering-zoomcamp
Free Data Engineering course!
data_lake_with_spark
Using Spark to build an ETL pipeline for a data lake hosted on S3.
data_warehouse
An ETL pipeline for a database hosted on Redshift.
postgres_data_modelling
A Postgres database with tables designed to optimize queries on song play analysis for a music streaming app.
UDND_capstone
Capstone project for Udacity Data Engineering Nano Degree.
ENEL645_Project
Deep learning
Movie-Theater-Ticket-App
a movie theater ticket registration app designed for ENSF 619 final project
Multi-class-multi-tag-classifier-with-PySpark
Extending Multi-class Multi-tag Classifier System for StackOverflow Questions by Gonzalez et al. using PySpark
Toolshop_client_server
ENSF607 and ENSF608 Joint Project, ToolShop Project