Lubo Zhang's repositories
openTSDB-client
A Java library that implements the synchronized http api for putting metrics and querying data from the OpenTSDB server.
ABRiS
Avro SerDe for Apache Spark structured APIs.
akka-quartz-scheduler
Quartz Extension and utilities for cron-style scheduling in Akka
AthenaX
SQL-based streaming analytics platform at scale
canal
阿里巴巴mysql数据库binlog的增量订阅&消费组件 。阿里云DRDS( https://www.aliyun.com/product/drds )、阿里巴巴TDDL 二级索引、小表复制powerd by canal.
CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
EasyML
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
example-spark-kafka
Apache Spark and Apache Kafka integration example
flink
Apache Flink
HiBench
HiBench is a big data benchmark suite.
hudi
Spark Library for Hadoop Upserts And Incrementals
interview_internal_reference
2019年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
IQL
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
kafka-connect-hdfs
Kafka Connect HDFS connector
kafka-spark-consumer
High Performance Kafka Consumer for Spark Streaming. Compatible with every Spark and Kafka versions including latest Spark 2.2.0 and Kafka 0.11.0. Now supports Kafka Security. Offset management in Zookeeper. Reliable No-Dataloss gurantee. No dependency on HDFS or Checkpointing and WAL. In-built PID rate controller. Support Message Interceptor . Offset Lag checker.
ksql
KSQL - the Streaming SQL Engine for Apache Kafka
mlflow
Open source platform for the complete machine learning lifecycle
movie-recommender-demo
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a live demo of a movie recommendation web application you can interact with. The demo also uses IBM Message Hub (kafka) to push application events to topic where they are consumed by a spark streaming job running on IBM BigInsights (hadoop).
OAP
Optimized Analytics Package for Spark Platform
Quicksql
Simpler, Safer, Faster Unified SQL Analytics Engine for Multi-Datasources
Rong360
用户贷款风险预测
ServiceFramework
Java MVC framework, agile, fast, rich domain model, made especially for server side of mobile application (一个敏捷,快速,富领域模型的Java MVC 框架,专为 移动应用后端量身定做)
spark
Mirror of Apache Spark
spark-kafka-writer
Write your Spark data to Kafka seamlessly
spark-structured-streaming-book
Notes about Structured Streaming in Apache Spark