云尘's starred repositories
KnowStreaming
一站式云原生实时流数据平台,通过0侵入、插件化构建企业级Kafka服务,极大降低操作、存储和管理实时流数据门槛
BigData-Interview
:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
dl-on-flink
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep learning training and inference on a Flink cluster.
tensorflow-handbook
简单粗暴 TensorFlow 2 | A Concise Handbook of TensorFlow 2 | 一本简明的 TensorFlow 2 入门指导教程
kudu-learning
kudu学习的一些资料,以及和spark/impala的集成使用
spark-structured-streaming-internals
The Internals of Spark Structured Streaming
git-history
Quickly browse the history of a file from any git repository
kubernetes
Production-Grade Container Scheduling and Management
bahir-flink
Mirror of Apache Bahir Flink
datasource_architecture
追源索骥-flink
flinkStreamSQL
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
flink-training-course
Flink 中文视频课程(持续更新...)
UserActionAnalyzePlatform
电商用户行为分析大数据平台
byzer-lang
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.