Soumil Nitin Shah (soumilshah1995)

soumilshah1995

Geek Repo

Company:Lead Data Engineer | AWS & Apache Hudi Expert | Spark & AWS Glue Enthusiast | YouTuber

Location:New York

Home Page:https://soumilshah.com/

Github PK Tool:Github PK Tool

Soumil Nitin Shah's repositories

hudi-kafka-learn

hudi-kafka-learn

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

Learn-How-to-take-Regular-Save-Points-for-Backup-purposes-for-your-Hudi-tables-with-Glue-4.0

Learn How to take Regular Save Points for Backup purposes for your Hudi tables with Glue 4.0

Language:PythonLicense:Apache-2.0Stargazers:4Issues:2Issues:0

Automating-EMR-Serverless-Workload-Creating-Submitting-Destroying-EMR-Cluster-using-Step-Funct

Automating EMR Serverless Workload | Creating| Submitting | Destroying EMR Cluster using Step Function

Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0

hudi-labs

hudi-labs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:0Issues:0

kafka-debezium-python

kafka-debezium-python

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

Project-Using-Apache-Hudi-Deltastreamer-and-AWS-DMS-Hands-on-Lab

Project : Using Apache Hudi Deltastreamer and AWS DMS Hands on Labs

License:Apache-2.0Stargazers:3Issues:3Issues:0

aws-dms-kinesis-flink-hudi

aws-dms-kinesis-flink-hudi

License:Apache-2.0Stargazers:2Issues:0Issues:0

Develop-Realtime-Streaming-Ingestion-from-MongoDB-Atlas-into-Hudi-Datalake-with-Glue-kinesis-and-ev

Develop Realtime Streaming Ingestion from MongoDB Atlas into Hudi Datalake with Glue, kinesis and eventbridge

Language:PythonLicense:Apache-2.0Stargazers:2Issues:2Issues:0

dynamodb-flink-kinesis

Real Time, Low latency Stream Pipeline with HUDI, Flink and Kinesis

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

kinesis-flink-labs

Learn About Apache hudi + Flink and Kinesis

License:Apache-2.0Stargazers:2Issues:0Issues:0

Power-your-Down-Stream-Elastic-Search-Stack-From-Apache-Hudi-Transaction-Datalake-with-CDC

Power your Down Stream Elastic Search Stack From Apache Hudi Transaction Datalake with CDC

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

source-to-target-mapping-python

source to target mapping python

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

-Architecture-Powering-Down-Stream-System-with-CDC-from-HUDI-Transactional-Datalake-

Architecture Powering Down Stream System with CDC from HUDI Transactional Datalake

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

Async-callback-Pattern-to-Automate-orchestrating-EMR-Serverless-Jobs-with-Step-Functions-

Async callback Pattern to Automate orchestrating EMR Serverless Jobs with Step Functions

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:2Issues:0

How-do-I-Ingest-Small-Files-into-Hudi-Datallake-with-Glue-Incremental-data-processing-

How do I Ingest Small Files into Hudi Datallake with Glue Incremental data processing

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

How-do-I-read-data-from-Cross-Account-S3-Buckets-and-Build-Hudi-Transactional-Datalake-in-Central-AW

How do I read data from Cross Account S3 Buckets and Build Hudi Transactional Datalake in Central AWS Account

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

hudi

Upserts, Deletes And Incremental Processing on Big Data.

License:Apache-2.0Stargazers:1Issues:0Issues:0

hudi-with-dbt-demo

hudi-with dbt-demo

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

apache-hudi-lake-formation

apache-hudi-lake-formation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Change-Data-Capture-in-Apache-Hudi-

Change Data Capture in Apache Hudi hands on lab

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

flink-cdc-connectors

CDC Connectors for Apache Flink®

License:Apache-2.0Stargazers:0Issues:0Issues:0

getting-started-with-emr-serverless-and-hudi

getting started with emr serverless and hudi

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

How-do-I-Mask-PII-data-in-Apache-Hudi-Datalake-Hands-on-Labs

How do I Mask PII data in Apache Hudi Datalake | Hands on Labs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

How-to-Build-Production-Ready-Alternative-Data-Pipeline-from-DynamoDB-to-Apache-Hudi-with-Streams-an

How to Build Production Ready Alternative Data Pipeline from DynamoDB to Apache Hudi with Streams and Firehose and Glue with Infrasture Code

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Incremental-Processing-Pipeline-to-power-Aurora-Postgres-SQL-from-Hudi-Transcational-Datalake-

Incremental Processing Pipeline to power Aurora Postgres SQL from Hudi Transcational Datalake

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Lets-Build-CDC-Pipeline-from-Microsoft-SQL-Server-into-Apache-Hudi-Transactional-Datalake-

Lets Build CDC Pipeline from Microsoft SQL Server into Apache Hudi Transactional Datalake

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

Real-Time-Streaming-Ingestion-Pipeline-Ingest-EventBridge-Events-into-Hudi-Datalake-with-Kinesis-and

Real Time Streaming Ingestion Pipeline Ingest EventBridge Events into Hudi Datalake with Kinesis and Glue

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0