Harry's repositories
mini-lakehouse
Data lakehouse at home with docker compose
airflow-provider-great-expectations
Great Expectations Airflow operator
leetcode-practice
Collection of LeetCode questions to ace the coding interview! - Created using [LeetHub](https://github.com/QasimWani/LeetHub)
charts
The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, it has since helped thousands of companies create production-ready deployments of Airflow on Kubernetes.
dag-factory
Dynamically generate Apache Airflow DAGs from YAML configuration files
data-on-eks
DoEKS is a tool to build, deploy and scale Data Platforms on Amazon EKS
dbt-spark
dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
featureform
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
featuretools
An open source python library for automated feature engineering
great_expectations
Always know what to expect from your data.
helm-charts
Community Helm Charts
hudi
Upserts, Deletes And Incremental Processing on Big Data.
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
mlflow-docker
Mlflow Docker Image
seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
sudohainguyen.github.io
✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
ydata-profiling
Create HTML profiling reports from pandas DataFrame objects
yunikorn-k8shim
Apache YuniKorn K8shim