chenliang's repositories
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
alluxio-extensions
Alluxio Extensions
amoro
Amoro is a Lakehouse management system built on open data lake formats.
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
centos-script
🎉centos下工具安装脚本,包含基础环境配置,Gitlab、Docker、LDAP、MongoDB、MySQL、RabbitMQ、Supervisor、Node、Python、zsh、rar、zabbix、k8s、prometheus、grafana等🎉
ClickHouse
ClickHouse® is a free analytics DBMS for big data
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
flink
Apache Flink
flink-sql-lineage
FlinkSQL字段血缘解决方案及源码。FlinkSQL field lineage solution and source code, The core idea is to parse SQL through Calcite to generate a RelNode tree of relational expressions. Then get the optimized logical paln through optimization stage, and finally call Calcite RelMetadataQuery to get the lineage relationship at the field level.
fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
hadoop
Apache Hadoop
hudi
Upserts, Deletes And Incremental Processing on Big Data.
iceberg
Apache Iceberg
incubator-kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
internal_reference
2021年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
kubernetes-handbook
Kubernetes中文指南/云原生应用架构实战手册 - https://jimmysong.io/kubernetes-handbook
kylin
Apache Kylin
llama.cpp
LLM inference in C/C++
masterspringboot
Source code for www.masterspringboot.com
OAP
Optimized Analytics Package for Spark* Platform
parquet-format
Apache Parquet
parquet-mr
Mirror of Apache Parquet
spark
Apache Spark - A unified analytics engine for large-scale data processing
spring-security-oauth
Support for adding OAuth1(a) and OAuth2 features (consumer and provider) for Spring web applications.
stable-diffusion-webui
Stable Diffusion web UI
tempto
A testing framework for Trino
the-algorithm
Source code for Twitter's Recommendation Algorithm
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)