guhaitao's repositories
BigData_AutomaticDeploy
大数据自动化部署,包括自动化部署hadoop、hive、hbase、spark、storm等等一系列组件
canal
阿里巴巴 MySQL binlog 增量订阅&消费组件
datasqueeze
Hadoop utility to compact small files
DataX
DataX是阿里云DataWorks数据集成的开源版本。
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
filecrush
Remedy small files by combining them into larger ones.
guhaitao
Config files for my GitHub profile.
hadoop-lzo
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
hadoop_exporter
A hadoop exporter for prometheus, scrape hadoop metrics (including HDFS, YARN, MAPREDUCE, HBASE. etc.) from hadoop components jmx url.
hadoop_jmx_exporter
HDFS & YARN jmx metrics prometheus exporter
hdfsutils
hdfs文件治理工具,文件批量解压、压缩、小文件合并
hive-phoenix-handler
hive-phoenix-handler is a hive plug-in that can access Apache Phoenix table on HBase using HiveQL.
hive-third-functions
Some useful custom hive udf functions, especial array, json, math, string functions.
learning-spark
Example code from Learning Spark book
nagios-plugins
Collection of some handy Nagios plugins
pentaho-kettle
Pentaho Data Integration ( ETL ) a.k.a Kettle
Shell_Script
Linux系统的安全,通过脚本对Linux系统进行一键检测和一键加固
Streamis
Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.
Synonyms
中文近义词工具包
YanX
研招网硕士专业目录下载;考研专业目录下载,招生人数,考试科目,考研专业,考研院校,A、B类地区,211、985、双一流;