云尘's repositories
troubleshooting-and-optimization
记录工作中的一些故障处理以及性能调优
flinkKafkaPartitionAsignAnalysis
Flink KafkaConsumer的partition分配代码分析
flume-kudu-sink
flume写入kudu的sink二次开发,增加主键自定义
spark
Apache Spark
sparkStreamingKafkaPerformance
spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议
wingcloud
wingcloud 基于微服务架构的实时计算(Flink)展示平台
streamingpro
Unify Big Data and Machine Learning.
flink-forward-china-2018
Flink Forward China 2018 Slides
davinci
Davinci is a DVaaS (Data Visualization as a Service) Platform
wormhole
Wormhole is a SPaaS (Stream Processing as a Service) Platform
DBus
DBus
moonbox
Moonbox is a DaaS (Data Virtualization as a Service) Platform
zookeeperStudy
zookeeper watcher、lock、selector的学习示例
sparkStreamingZkWatcherModifyConfig
zookeeper watcher 在spark streaming中的应用,用于不停流动态改变/增加配置
scala-style-guide
Databricks Scala Coding Style Guide
bigdataMonitor
通过cloudera manager/ResourceManager/Flink API来监控相应应用的状态并输出json给splunk
hosts
镜像:https://coding.net/u/scaffrey/p/hosts/git
sparkStreamingTopN
spark streaming multithreading comsumer kafka and write to kudu/mysql
cmAutomationDeploy
从jenkins构建结果中拉取代码包,然后调用ansible执行CDH集群及其上应用的持续集成与交付
kuduStreaming
syslog->flume->kafka->spark streaming->kudu
sparkToIgnite
从hdfs上导入数据到ignite
cloudera-csd
flink/ignite/tomcat/java .etc csd of cloudera manager
spark-learning
spark基础、spark读写不同数据源、spark sql编程、spark streaming编程、spark mlib编程
ProcessorForNIFI
NIFI工具中的processor定制开发,以及简单的使用
kudu
Mirror of Apache Kudu
kudu-learning
kudu学习的一些资料,以及和spark/impala的集成使用
SparkStreamingToKudu
spark streaming from kafka to kudu