There are 2 repositories under redshift-cluster topic.
Beginner data engineering project - batch edition
Terraform module to create AWS Redshift resources 🇺🇦
This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.
Use AWS Lambda to Pull E-Scooter and E-Bike Location Data, store in S3 & Redshift using Data Vault Data Model, Server to Google Data Studio Dashboard
Data Warehouse with AWS Redshift and Visualizing data using Power BI
Simple getting started 1-node redshift cluster stack
The idea is how can we prepare data to be used by Business Intelligence applications like Tableu or even Jupyternotebook! 👍 In order to help the business see an overview of the data in a diagram of what important features of the product their customers might be using. Mainly, how can we improve the performance of these OLAP and OLTP transactions? For that, we use the combination of star schema tables, we build a strategy for a distributed data system, and do grouping for all the features thanks to REDSHIFT.
Airflow orchestrated ETL (running in docker containers) that pulls batch data from an API to a local Postgres database, loads to AWS S3/Redshift provisioned by Terraform, and visualized in Quicksight.
Building and executing end-to-end ELT pipeline and driving analytics using Amazon Redshift as the data warehouse solution.
In this project, an ETL pipeline is built for a database hosted on Redshift. In this project, data from S3 is loaded to staging tables on Redshift and execute SQL statements that create the analytics tables from these staging tables.
Sparkify - Data Pipelines with Airflow - Udacity Data Engineering Expert Track.
Udacity Course, Data Engineering Nanodegree, 3rd Project, Data Warehouse with Amazon Redshift
Data Pipeline with Apache Airflow
Deploy an Amazon Redshift Cluster in AWS using Terraform
Implement ETL data pipeline that reads data from S3 bucket and loads data into AWS redshift using Airflow
A repository concentrating on using High end parallel pipelines to perform ETL across various data sources
Database Schema & ETL pipeline for Song Play Analysis | Bosch AI Talent Accelerator Scholarship Program
data pipeline ETL using Apache Airflow form data movement and amazon s3-storge with redshift cluster to storing the data in fact and DEM teables
Udacity Data Engeneering Nanodegree Program - My Submission of Project: Data Warehouse
This project builds a pipeline from AWS S3 storage to a Redshift Cloud hosted Cluster where an ETL process extracts the staged data and creates a STAR Schema to allow business users to query the data easier for business intelligence.
This project is a data warehousing solution for Sparkify, a music streaming service to extract data from JSON logs and stores it in a star schema data model in Amazon Redshift.
Udacity Data Engineering project: Data Warehouse
applying data warehouses tools and AWS to build an ETL pipeline for a database hosted on Redshift. loading data from AWS S3 bucket to staging tables on Redshift and executing SQL statements that create the analytics tables from these staging tables.
Python ETL pipeline to load data from Amazon S3 to Redshift analytics tables
Warehousing with redshift
AWS Pipeline examples from Udacity Date Engineering Nanodegree.
A music streaming startup, Sparkify, has grown their user base and song database and want to move their processes and data onto the cloud. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in their app. The objective of the project is to create an ETL pieline to build a datawarehouse . We extract data from S3, stage them in Redshift, and transform data into a set of dimensional tables for the analytics team to continue finding insights into what songs their users are listening to.
Udacity Data Engineering Nanodegree Project 3
Simplistic Terraform module for creating a AWS Redshift cluster to allow direct access programmatically or via a tool like DBeaver, pgAdmin, etc.
The data is collected from IMDB and then transformed before loading to warehouse
A data warehouse on Amazon Redshift using a star schema to facilitate the analysis of user behaviour on a music streaming app.
Query-able API analyzing pay gap between WNBA and NBA by scraping multiple data sources utilizing Python and Beautiful Soup