rterror's repositories
bigdata-file-viewer
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
1-DataTechNews
坚持记录一些有关于数据处理相关的最新技术进展
ansible-tuto
Ansible tutorial
arthas
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
bigdata-notes
bigdata learning notes
english_materials
English leanring materials
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
brickhouse
Hive UDF's for the data warehouse
clean-code
《代码整洁之道》源码
compass
Compass is a task diagnosis platform for bigdata
cpp_new_features
2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
docker-hive
Docker image for Apache Hive Metastore
dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
duckdb
DuckDB is an in-process SQL OLAP Database Management System
effectivescala
Twitter's Effective Scala Guide
flink-learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
God-Of-BigData
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
hawq
Apache HAWQ
hive-third-functions
Some useful custom hive udf functions, especial array, json, math, string functions.
hudi-resources
汇总Apache Hudi相关资料
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
jvm-profiler
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
p3c
Alibaba Java Coding Guidelines pmd implements and IDE plugin
SoftwareArchitect
Path to a Software Architect
spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
spark-scala-examples
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.