Daniel Jeon's repositories
data-science-study-bookmarks-for-korean
데이터 사이언스 공부를 위한 즐겨찾기 모음 (한국인을 위한)
docker-logstash-alpine
Alpine Linux based Logstash Docker Image
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
beeva-best-practices
Best Practices and Style Guides in BEEVA
docker-spark
Apache Spark docker image
dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
incubator-superset
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
logstash-input-pulsar
Puslar input for Logstash
logstash-output-pulsar-1
Logstash output plugin for Apache Pulsar (using kafka-compatiable interface), ugly but work.
macOS-thunderbolt3-enable
enable thunderbolt 3 device (only windows) on macOS 10.15.x (catalina)
parquet-mr
Apache Parquet
perf-tools
Performance analysis tools based on Linux perf_events (aka perf) and ftrace
pulsar-manager
A tool for managing Apache Pulsar.
purge-wrangler
AMD & NVIDIA eGPUs for all Thunderbolt Macs.
spark
Apache Spark - A unified analytics engine for large-scale data processing
tb3-enabler
Enable Thunderbolt 3 for unsupported peripherals on macOS
Thunderbolt3Unblocker
Enable unsupported Thunderbolt 3 peripherals on macOS