david.zyw's repositories

aws-sdk-java

The official AWS SDK for Java.

License:Apache-2.0Stargazers:0Issues:0Issues:0

banzai-charts

Curated list of Banzai Cloud Helm charts used by the Pipeline Platform

Language:MustacheLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bigdata-platform-on-k8s

deploy bigdata platform on kubernetes

Language:ShellStargazers:0Issues:0Issues:0

cube-studio

Cloud native one-stop machine learning platform, Multi-user, Dataleap, Notebook, Drag-and-Drop pipeline, Multi-machine multi-gpu distributed training, Automl, Inference, Edge computing, Federation schedule, Real time training, large models, AIhub

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

datahub

The Metadata Platform for the Modern Data Stack

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dinky

Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

docusaurus

Easy to maintain open source documentation websites.

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

flink

Apache Flink

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flink-connector-elasticsearch

Apache Flink connector for ElasticSearch

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flink-docker

Docker packaging for Apache Flink

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flink-kubernetes-operator

Apache Flink Kubernetes Operator

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hadoop

Apache Hadoop

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

iperf-jperf

Improvements to jperf, a Java interface to the iperf network throughput testing suite

Language:JavaStargazers:0Issues:0Issues:0

presto-on-k8s

Deploying Presto on K8S as a cloud OLAP Serviceļ¼Œ dynamic scaling based on HPA

Language:ShellStargazers:0Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spark-on-k8s-operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

superset-2.1.0rc1

Apache Superset is a Data Visualization and Data Exploration Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

talk-demos

Code & docs for Pipekit's talks

Language:PythonStargazers:0Issues:0Issues:0

transporter

Sync data between persistence engines, like ETL only not stodgy

Language:GoLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0