sogn's repositories
Elasticsearch-Hbase
elasticsearch+hbase海量数据查询,支持千万数据秒回查询
flink
Apache Flink
hadoop-mr
hadoop配置文件及基本实现
HBaseObserver
通过HBase Observer同步数据到ElasticSearch
incubator-dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-extend visual workflow scheduling platform, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)
incubator-hudi
Upserts, Deletes And Incremental Processing on Big Data.
kafka-spark-consumer
High Performance Kafka Consumer for Spark Streaming. Compatible with every Spark and Kafka versions including latest Spark 2.3.x and Kafka 2.0.0. Now supports Kafka Security,Kafka Headers. Offset management in Zookeeper. Reliable No Data-loss guarantee. No dependency on HDFS and WAL. In-built PID rate controller. Support Message Interceptor . Offset Lag checker.
kafka-spark-streaming-example
Simple examle for Spark Streaming over Kafka topic
kraps-rpc
A RPC framework leveraging Spark RPC module
python_spider_demo
爬虫小例子
useful-scripts
🐌 useful scripts for making developer's everyday life easier and happier, involved java, shell etc.