LeeByungwoo's starred repositories
machine-learning-systems-design
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
cryptoexchange-go
API wrapper for cryptocurrency exchanges impemented in golang
build-your-own-x
Master programming by recreating your favorite technologies from scratch.
awesome-ab-testing
AB Testing 📈 related articles
awesome-apache-airflow
Curated list of resources about Apache Airflow
jupyterlite
Wasm powered Jupyter running in the browser 💡
dear-github
:incoming_envelope: An open letter to GitHub from the maintainers of open source projects
aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
data-engineering-interview-questions
More than 2000+ Data engineer interview questions.
metabase-athena-driver
An Amazon Athena driver for Metabase 0.32 and later
spark-on-kubernetes-docker
Spark on Kubernetes infrastructure Docker images repo
sparksql-scalapb
SparkSQL utils for ScalaPB
go-elasticsearch
The official Go client for Elasticsearch
aws-mwaa-local-runner
This repository provides a command line interface (CLI) utility that replicates an Amazon Managed Workflows for Apache Airflow (MWAA) environment locally.
smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
singlestore-spark-connector
A connector for SingleStore and Spark