云尘's repositories
sparkStreamingKafkaPerformance
spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议
kudu-learning
kudu学习的一些资料,以及和spark/impala的集成使用
cloudera-csd
flink/ignite/tomcat/java .etc csd of cloudera manager
troubleshooting-and-optimization
记录工作中的一些故障处理以及性能调优
bigdataMonitor
通过cloudera manager/ResourceManager/Flink API来监控相应应用的状态并输出json给splunk
flume-kudu-sink
flume写入kudu的sink二次开发,增加主键自定义
ProcessorForNIFI
NIFI工具中的processor定制开发,以及简单的使用
flink-forward-china-2018
Flink Forward China 2018 Slides
kuduStreaming
syslog->flume->kafka->spark streaming->kudu
spark-learning
spark基础、spark读写不同数据源、spark sql编程、spark streaming编程、spark mlib编程
sparkStreamingTopN
spark streaming multithreading comsumer kafka and write to kudu/mysql
sparkStreamingZkWatcherModifyConfig
zookeeper watcher 在spark streaming中的应用,用于不停流动态改变/增加配置
flinkKafkaPartitionAsignAnalysis
Flink KafkaConsumer的partition分配代码分析
cmAutomationDeploy
从jenkins构建结果中拉取代码包,然后调用ansible执行CDH集群及其上应用的持续集成与交付
davinci
Davinci is a DVaaS (Data Visualization as a Service) Platform
DBus
DBus
hosts
镜像:https://coding.net/u/scaffrey/p/hosts/git
scala-style-guide
Databricks Scala Coding Style Guide
spark
Apache Spark
SparkStreamingToKudu
spark streaming from kafka to kudu
sparkToIgnite
从hdfs上导入数据到ignite
streamingpro
Unify Big Data and Machine Learning.
wingcloud
wingcloud 基于微服务架构的实时计算(Flink)展示平台
zookeeperStudy
zookeeper watcher、lock、selector的学习示例