Soumil Nitin Shah (soumilshah1995)

soumilshah1995

Geek Repo

Company:Lead Data Engineer | AWS & Apache Hudi Expert | Spark & AWS Glue Enthusiast | YouTuber

Location:New York

Home Page:https://soumilshah.com/

Github PK Tool:Github PK Tool

Soumil Nitin Shah's repositories

LakeBoost

LakeBoost

Language:PythonLicense:Apache-2.0Stargazers:6Issues:2Issues:0

StarRocks-Hudi-Minio

StarRocks+Hudi+Minio

Language:PythonLicense:GPL-3.0Stargazers:5Issues:0Issues:0

hudi-minio-starrpcks-superset

hudi-minio-starrpcks-superset

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

apache-hudi-delta-streamer-labs

apache hudi delta streamer labs

Language:PythonLicense:GPL-3.0Stargazers:3Issues:0Issues:0

Datalake-to-Microservices-Apache-Hudi-FastAPI-Spark

From Datalake to Microservices: Unleashing the Power of Apache Hudi's Record Level Index with FastAPI and Spark Connect

Language:PythonLicense:BSL-1.0Stargazers:1Issues:0Issues:0

Dynamic-Hudi-Postgres-Ingestion

Dynamic Hudi Delta Streamer Jobs with JDBC Puller for PostgreSQL Tables, Bringing All Tables into Hudi and Running Jobs in Parallel

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

HUDI-Spark-DBT-Glue-Hive-Metastore-Run-Locally-

HUDI + Spark+ DBT + Glue Hive Metastore Run Locally

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:1Issues:0Issues:0

Apache-Hudi-Table-Services-Hands-on-labs

pache Hudi Table Services | Hands on labs

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

aws-hudi-delta-iceberg-interoperability

aws-hudi-delta-iceberg-interoperability

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

Get-Started-with-Hudi-CLI-Locally-Using-Docker-in-Minutes-and-Connect-to-Your-S3-Data-

Get Started with Hudi CLI Locally Using Docker in Minutes and Connect to Your S3 Data

License:Apache-2.0Stargazers:0Issues:1Issues:0

glue-dot-interactive-session-template

glue-dot-interactive-session-template

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

hudi-and-glue-locally

Apache Hudi and AWS Glue docker compose demo

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Learn-How-to-Integerate-Hudi-Spark-job-with-Airflow-and-MinIO

Learn How to Integerate Hudi Spark job with Airflow and MinIO

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

one-table-with-deltastreamer

one table-with-deltastreamer

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

onetable-delta-multimodal-index-builder

onetable-delta-multimodal-index-builder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

onetable-deltastreamer-glue

onetable-deltastreamer-glue

License:CC0-1.0Stargazers:0Issues:0Issues:0

openhouse

Open Control Plane for Tables in Data Lakehouse

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

ruff

An extremely fast Python linter and code formatter, written in Rust.

License:MITStargazers:0Issues:0Issues:0

Simplified-Delta-Streamer-Job-Management-A-Structured-Approach-for-Efficient-Data-Processing

Simplified Delta Streamer Job Management: A Structured Approach for Efficient Data Processing

License:Apache-2.0Stargazers:0Issues:0Issues:0

Simplifying-Big-Data-Setting-Up-Spark-SQL-Hive-Thrift-Server-and-Hudi-with-Beeline-in-Minutes-

Simplifying Big Data: Setting Up Spark SQL, Hive Thrift Server, and Hudi with Beeline in Minutes

License:GPL-3.0Stargazers:0Issues:0Issues:0

sling-cli

Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.

License:GPL-3.0Stargazers:0Issues:0Issues:0

sling-etl-cli-demo

sling-etl-cli-demo

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

sling-to-starrocks-demo

sling-to-starrocks-demo

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

sqlglot

Python SQL Parser and Transpiler

License:MITStargazers:0Issues:0Issues:0

vectordb

A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.

License:MITStargazers:0Issues:0Issues:0

xtable-with-emr-serverless

stable-with-emr-serverless

License:GPL-3.0Stargazers:0Issues:0Issues:0