Soumil Nitin Shah's repositories
code-snippets
code-snippets
DebeziumFlinkHudiSync
Bringing Data from MySQL to Kafka Using Debezium, Joining Kafka Topics with Flink, Upserting into a New Kafka Topic, and Ingesting into Hudi Real-Time
LinkedIn-Easy-Apply-Bot
Automate the application process on LinkedIn
universal-data-lakehouse-xTable-MinIO-Trino
universal-data-lakehouse-xTable-MinIO-Trino
DeltaHudiTransformations
DeltaHudiTransformations
flink-iceberg-hive
flink-iceberg-hive
trino-kafka-demo
Hands-on demo for querying Kafka streams using SQL with Trino and data integration with PostgreSQL.
universal-datalakehouse-postgres-ingestion-deltastreamer
universal-datalakehouse-postgres-ingestion-deltastreamer
daft-hudi-examples
daft-hudi-examples
emr-serverless-airflow-deltastreamer-jobs
emr-serverless-airflow-deltastreamer-jobs
hudi-daft-lambda
hudi-daft-lambda
universal-datalakehouse-mysql-ingestion-deltastreamer
universal-datalakehouse-mysql-ingestion-deltastreamer
apache-x-table-sync-aws-cloud-shell
apache-x-table-sync-aws-cloud-shell
Daft
Distributed DataFrame for Python designed for the cloud, powered by Rust
DaftHudi
Build Analytical Applications on Data Lakehouse with Apache Hudi, Daft & Streamlit
DataLakeHouseX-Apache-XTable-MinIO-StarRocks-DeltaStreamer-Hudi-IceBerg-Delta-Interoperability-
DataLakeHouseX: Apache XTable, MinIO, StarRocks, DeltaStreamer, Hudi, IceBerg, Delta Interoperability"
DeltaStream-BroadcastJoinETL
DeltaStream-BroadcastJoinETL
DeltaStreamer-Airflow-EMR-Xtable
DeltaStreamer-Airflow-EMR-Xtable
DMS-to-S3-Single-Table-Integration
A Simple Config-Driven Python Template for Rapid DMS to S3 Integration | Single Task per Table Strategy
election-stock-analysis
election-stock-analysis
event-driven-dms-failure-alerts
event-driven-dms-failure-alerts
hudi-aws-glue-0.14
How to use Hudi 0.14 on AWS glue
hudi-datedim
hudi-datedim
Hudi-spark-sql-minio
Hudi-spark-sql-minio
hudi-streamer-pulsar
hudi-streamer-pulsar
hudi-trino-integeration-guide
hudi-trino-integeration-guide
HudiDeltaStreamer-SCD-Trino
HudiDeltaStreamer-SCD-Trino
Multiple-Spark-Writers-with-Apache-Hudi
Multiple Spark Writers with Apache Hudi
trino-k8-locally
trino-k8-locally
unitycatalog
Open, Multi-modal Catalog for Data & AI