park.suhyuk's repositories
data-engineer-intermediate-training
Data Engineering Intermediate Training Course
data-engineer-advanced-training
data engineer advanced training course
data-engineer-basic-training
data engineer basic training course
docker-for-dummies
docker for dummies
helloworld
hello world for git test
data-engineer-druid
Apache Druid (Incubating) - Column oriented distributed data store ideal for powering interactive applications
docker-kafka
Dockerfile for Apache Kafka
psyoblade.github.io
psyoblade.github.io
delta-for-dummies
delta lake tutorial
ssm-seoul-data-engineer
SSM Seoul Data Engineer Course Books
study-for-me
2023년에 공부한 내용 정리
all-spark-notebook
Apache Spark 실습을 위한 Docker 이미지 (AWS S3 저장)
BlackHole
BlackHole is a modern macOS virtual audio driver that allows applications to pass audio to other applications with zero additional latency.
ClickHouse
ClickHouse® is a free analytics DBMS for big data
delta-docker
Official Dockerfile for Delta Lake
delta-examples
Delta Lake examples
docker-hadoop-workbench
A Hadoop cluster based on Docker, including Hive and Spark.
docker-sqoop
Apache Sqoop docker image
docker-stacks
Ready-to-run Docker images containing Jupyter applications
github-slideshow
A robot powered training repository :robot:
hadoop
Apache Hadoop
junit5-samples
Collection of sample applications using JUnit 5.
mongo-spark
The MongoDB Spark Connector
pyspark-notebook-deltalake-docker
Jupyter Notebook Docker with Spark and DeltaLake support
spark-clickhouse-connector
Spark ClickHouse Connector build on DataSourceV2 API
streaming-at-scale
How to implement a streaming at scale solution in Azure
winutils
Windows binaries for Hadoop versions (built from the git commit ID used for the ASF relase)