slotbite / data-engineering-workshop

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data-Engineering-Workshop

These labs are designed to be completed in sequence, and the full set of instructions are documented in this repository. Read and follow along to complete the labs. Our lab instructor will give you a high-level overview of the labs and help answer any questions. Don’t worry if you get stuck, we provide hints along the way.

For the hands-on part of this workshop you need your laptop with Internet Access and an internet browser: Mozilla Firefox or Google Chrome.

You are encouraged to form groups of 3-4 based on the AWS hanson experience in big data services and at least one person in your group must have a laptop to perform the tasks.

Prepare Event Engine login

Event-Engine-login

Lab 1: Perform Data Ingestion with Data Migration Service

Lab 2: Data Transformations ETL with Glue

Lab 3: Explore this DataLake using SQL and Visualization tool

Lab 4: Data Lake Automation with Lake Formation

Lab 5: Do AI-ML workload using-Sagemaker (Optional)

AI-ML-using-Sagemaker sagemaker-arch

Lab 6: Ingest and Analysis the Real-time data (Optional)

Clickstream Anomaly Detection kinesis-arch

About