soumilshah1995 / dynamodb-hudi-stream-project

dynamodb-hudi-stream-project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Picture1

Project Overview

  • Users in this architecture purchase things from online retailers and generate an order transaction that is kept in DynamoDB. The raw data layer stores the order transaction data that is fed into the data lake. To accomplish this, enable Kinesis Data Streams for DynamoDB, and we will stream real-time transactions from DynamoDB into kinesis data streams, process the streaming data with lambda, and insert the data into the next kinesis stream, where a glue streaming job will process and insert the data into Apache Hudi Transaction data lake, and build dashboards and derive insights using QuickSight.

Video Tutorial

Code

Step by Step guide with Instruction and Screenshots

Steps

Step 1:

  • Users in this architecture purchase things from online retailers and generate an order transaction that is kept in DynamoDB.

Step 2:

  • The raw data layer stores the order transaction data that is fed into the data lake. To accomplish this, enable Kinesis Data Streams for DynamoDB, and we will stream real-time transactions from DynamoDB into kinesis data streams, process the streaming data with lambda, and insert the data into the next kinesis stream, where a glue streaming job will process and insert the data into Apache Hudi Transaction data lake.

Step 3:

  • Users can build dashboards and derive insights using QuickSight.

About

dynamodb-hudi-stream-project

License:Apache License 2.0


Languages

Language:Python 92.0%Language:Shell 8.0%