MOBIN's repositories
TravelPriceComparison
旅游比价决策系统(全国云计算应用创新大赛三等奖作品)
BigDataLearning
总结了一些Spark学习过程中的例子(附代码详细注释)
Binlog2Hive
MySQL增量数据实时同步到HDFS/Hive
CollectorProject
采集--每两分钟监测采集目录是否有新文件,有新文件则采集到HDFS,并对采集过的文件进行标记防止重复采集(一个学习多线程并发的很好例子)
CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
hadoop-2.5.2
1.HDFS源码分析,代码注释参考自《Hadoop2.x HDFS源码剖析》
iceberg-spark-tpcds-benchmark
iceberg-spark-tpcds-benchmark
CSVToExcel
CSV格式转成Excel二维表及透视表
MOBINMetrics
基于VictoriaMetrics On K8s的监控平台
iceberg
Apache Iceberg
Algorithms
算法与Leetcode
apollo
Apollo(阿波罗)是携程框架部门研发的配置管理平台,能够集中化管理应用不同环境、不同集群的配置,配置修改后能够实时推送到应用端,并且具备规范的权限、流程治理等特性。
doris-flink-connector
Flink Connector for Apache Doris
flink-connector-elasticsearch
Apache Flink connector for ElasticSearch
flink-connector-hbase
Apache flink
flink-table-store
An Apache Flink subproject to provide storage for dynamic tables.
hudi
Upserts, Deletes And Incremental Processing on Big Data.
iceberg-docs
Apache Iceberg Documentation Site
spark-tpcds-datagen
All the things about TPC-DS in Apache Spark
SparkInternals
Notes talking about the design and implementation of Apache Spark
streamx
Make stream processing easier! Flink & Spark development scaffold, The original intention of StreamX is to make the development of Flink easier. StreamX focuses on the management of development phases and tasks. Our ultimate goal is to build a one-stop big data solution integrating stream processing, batch processing, data warehouse and data laker.