Genmao Yu's repositories
ps-on-spark
Originally forked from Apache Spark, integrated with a simplified version of parameter server, supporting large-scale model training.
aliyun-emapreduce-sdk
Spark on Aliyun, supporting interactions with Aliyun's base services.
flinkStreamSQL
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
librdkafka
The Apache Kafka C/C++ library
openapi-sdk-php
The OpenAPI SDK for PHP with Composer support
schema-registry-ui
Web tool for Avro Schema Registry |
spark-hive-streaming-sink
A sink to save Spark Structured Streaming DataFrame into Hive table
spark-structured-streaming-jdbc-sink
Spark Structured Streaming JDBC Sink
tranquility
Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover, seamlessly and without downtime.
useful-scripts
🐌 useful scripts for making developer's everyday life easier and happier
waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.