Soumil Nitin Shah's repositories
install-external-python-packages-on-serverless
install external python packages on serverless
PythonLambdaDockerECR
PythonLambdaDockerECR
fastapi-python
Learn How to make and deploy Fast api in python using docker
run-aws-glue-locally-docker
run-aws-glue-locally-docker
Unlocking-Incremental-Data-in-PySpark-Extracting-from-JDBC-Sources-without-Debezium-or-AWS-DMS-with
Unlocking Incremental Data in PySpark: Extracting from JDBC Sources without Debezium or AWS DMS with CDC
Efficient-Data-Ingestion-with-Glue-Concurrency-Using-a-Single-Template-for-Multiple-S3-Tables-into-
Efficient Data Ingestion with Glue Concurrency: Using a Single Template for Multiple S3 Tables into a Transactional Hudi Data Lake
Project-Using-Apache-Hudi-Deltastreamer-and-AWS-DMS-Hands-on-Lab
Project : Using Apache Hudi Deltastreamer and AWS DMS Hands on Labs
Power-your-Down-Stream-Elastic-Search-Stack-From-Apache-Hudi-Transaction-Datalake-with-CDC
Power your Down Stream Elastic Search Stack From Apache Hudi Transaction Datalake with CDC
source-to-target-mapping-python
source to target mapping python
-Architecture-Powering-Down-Stream-System-with-CDC-from-HUDI-Transactional-Datalake-
Architecture Powering Down Stream System with CDC from HUDI Transactional Datalake
aws-glue-studio-and-clickhous-etl-job
AWS Glue Studio and ClickHouse Integration
emr-serverless-labs-cli
emr-serverless-labs-cli
How-do-I-read-data-from-Cross-Account-S3-Buckets-and-Build-Hudi-Transactional-Datalake-in-Central-AW
How do I read data from Cross Account S3 Buckets and Build Hudi Transactional Datalake in Central AWS Account
Advantages-of-Metadata-Indexing-and-Asynchronous-Indexing-in-Hudi-Hands-on-Lab
Advantages of Metadata Indexing and Asynchronous Indexing in Hudi Hands on Lab
amazon-emr-cli
A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs
Bootstrapping-in-Apache-Hudi-on-EMR-Serverless
Bootstrapping in Apache Hudi on EMR Serverless
Change-Data-Capture-in-Apache-Hudi-
Change Data Capture in Apache Hudi hands on lab
ci-cd-serverless-spark-1
Sample CI/CD pipeline for using GitHub Actions with Amazon EMR Serverless Spark.
Clustering-in-Hudi-hands-on-Labs
Clustering in Hudi hands on Labs
Efficient-Data-Lake-Management-with-Apache-Hudi-Cleaner-Benefits-of-Scheduling-Data-Cleaning
Efficient Data Lake Management with Apache Hudi Cleaner: Benefits of Scheduling Data Cleaning
How-to-Query-Hudi-Tables-in-Incremnetal-Fashion-and-Get-only-New-data-on-AWS-GLue-
How to Query Hudi Tables in Incremnetal Fashion and Get only New data on AWS GLue
Incremental-Processing-Pipeline-to-power-Aurora-Postgres-SQL-from-Hudi-Transcational-Datalake-
Incremental Processing Pipeline to power Aurora Postgres SQL from Hudi Transcational Datalake
Learn-about-Apache-Hudi-Transformers-with-Hands-on-Lab
Learn about Apache Hudi Transformers with Hands on Lab
Learn-How-to-Interrelate-Apache-Hudi-with-Redshift-Spectrum-Hands-on-Labs
Learn How to Interrelate Apache Hudi with Redshift Spectrum Hands on Labs
Lets-Build-CDC-Pipeline-from-Microsoft-SQL-Server-into-Apache-Hudi-Transactional-Datalake-
Lets Build CDC Pipeline from Microsoft SQL Server into Apache Hudi Transactional Datalake
Running-Apache-Hudi-Delta-Streamer-On-EMR-Serverless-Hands-on-Lab-step-by-step-guide-for-beginners
Running Apache Hudi Delta Streamer On EMR Serverless Hands on Lab step by step guide for beginners
Step-by-Step-Guide-to-Incrementally-Pulling-Data-from-JDBC-with-Python-and-PySpark
Step-by-Step Guide to Incrementally Pulling Data from JDBC with Python and PySpark