Data Minded's repositories
lighthouse
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
tpcds-dbt-duckdb
This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb
conveyor-samples
Samples on how to use Conveyor.
homebrew-conveyor-formulas
Brew tap repository for Conveyor
iceberg-ingestion
Public repository containing sample code for how to improve ETL ingestion processes with Apache Iceberg
conveyor-templates
Cookiecutter templates used by Conveyor.
conveyor-dbt-demo-templates
Templates for the conveyor dbt demo
autoscaler
Autoscaling components for Kubernetes
dbt-conveyor-snowflake
The Conveyor Snowflake adapter is a thin shell around the Snowflake adapter to allow authenticating users in Conveyor IDE's with Snowflake to run DBT projects
dbt_playground
Try out dbt in a Gitpod environment in one click, with a Postgres database pre-configured
ecr-mirror
Mirror public repositories to internal ECR repos
eks-spark-benchmark
Performance optimization for Spark running on Kubernetes
git-credential-oauth
A Git credential helper that securely authenticates to GitHub, GitLab and BitBucket using OAuth.
iris
Artifacts related to a training on running stream processing pipelines
karpenter
Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.
karpenter-core
Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.
platform-quack-quack-ka-ching
The duck escapes with the credits.
terraform-aws-eks
Terraform module to create an Elastic Kubernetes (EKS) cluster and associated resources 🇺🇦