Soumil Nitin Shah (soumilshah1995)

soumilshah1995

Geek Repo

Company:Lead Data Engineer | AWS & Apache Hudi Expert | Spark & AWS Glue Enthusiast | YouTuber

Location:New York

Home Page:https://soumilshah.com/

Github PK Tool:Github PK Tool

Soumil Nitin Shah's repositories

code-snippets

code-snippets

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5Issues:2Issues:0

DebeziumFlinkHudiSync

Bringing Data from MySQL to Kafka Using Debezium, Joining Kafka Topics with Flink, Upserting into a New Kafka Topic, and Ingesting into Hudi Real-Time

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:0Issues:0

LinkedIn-Easy-Apply-Bot

Automate the application process on LinkedIn

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

universal-data-lakehouse-xTable-MinIO-Trino

universal-data-lakehouse-xTable-MinIO-Trino

Language:Jupyter NotebookLicense:MITStargazers:3Issues:0Issues:0

DeltaHudiTransformations

DeltaHudiTransformations

License:Apache-2.0Stargazers:2Issues:0Issues:0

flink-iceberg-hive

flink-iceberg-hive

Language:DockerfileLicense:BSL-1.0Stargazers:2Issues:0Issues:0

trino-kafka-demo

Hands-on demo for querying Kafka streams using SQL with Trino and data integration with PostgreSQL.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:0Issues:0

universal-datalakehouse-postgres-ingestion-deltastreamer

universal-datalakehouse-postgres-ingestion-deltastreamer

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:2Issues:0Issues:0

daft-hudi-examples

daft-hudi-examples

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

emr-serverless-airflow-deltastreamer-jobs

emr-serverless-airflow-deltastreamer-jobs

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

hudi-daft-lambda

hudi-daft-lambda

Language:PythonLicense:BSD-2-ClauseStargazers:1Issues:0Issues:0

universal-datalakehouse-mysql-ingestion-deltastreamer

universal-datalakehouse-mysql-ingestion-deltastreamer

Language:Jupyter NotebookLicense:BSL-1.0Stargazers:1Issues:0Issues:0

apache-x-table-sync-aws-cloud-shell

apache-x-table-sync-aws-cloud-shell

License:GPL-3.0Stargazers:0Issues:0Issues:0

Daft

Distributed DataFrame for Python designed for the cloud, powered by Rust

License:Apache-2.0Stargazers:0Issues:0Issues:0

DaftHudi

Build Analytical Applications on Data Lakehouse with Apache Hudi, Daft & Streamlit

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DataLakeHouseX-Apache-XTable-MinIO-StarRocks-DeltaStreamer-Hudi-IceBerg-Delta-Interoperability-

DataLakeHouseX: Apache XTable, MinIO, StarRocks, DeltaStreamer, Hudi, IceBerg, Delta Interoperability"

Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

DeltaStream-BroadcastJoinETL

DeltaStream-BroadcastJoinETL

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeltaStreamer-Airflow-EMR-Xtable

DeltaStreamer-Airflow-EMR-Xtable

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

DMS-to-S3-Single-Table-Integration

A Simple Config-Driven Python Template for Rapid DMS to S3 Integration | Single Task per Table Strategy

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

election-stock-analysis

election-stock-analysis

License:GPL-3.0Stargazers:0Issues:0Issues:0

event-driven-dms-failure-alerts

event-driven-dms-failure-alerts

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

hudi-aws-glue-0.14

How to use Hudi 0.14 on AWS glue

Stargazers:0Issues:0Issues:0

hudi-datedim

hudi-datedim

License:Apache-2.0Stargazers:0Issues:0Issues:0

Hudi-spark-sql-minio

Hudi-spark-sql-minio

License:GPL-3.0Stargazers:0Issues:0Issues:0

hudi-streamer-pulsar

hudi-streamer-pulsar

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

hudi-trino-integeration-guide

hudi-trino-integeration-guide

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

HudiDeltaStreamer-SCD-Trino

HudiDeltaStreamer-SCD-Trino

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Multiple-Spark-Writers-with-Apache-Hudi

Multiple Spark Writers with Apache Hudi

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

trino-k8-locally

trino-k8-locally

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

unitycatalog

Open, Multi-modal Catalog for Data & AI

License:Apache-2.0Stargazers:0Issues:0Issues:0