smokeriu's repositories
clash-rule
A repository to store clash rule/config
compute-platform
wapper compute-platform by spark/flink
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
doris-spark-connector
Spark Connector for Apache Doris
elasticsearch-hadoop
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
flink-java-demo
Flink Demo with Java
hbase-connectors
Apache HBase Connectors
hudi
Upserts, Deletes And Incremental Processing on Big Data.
iceberg
Apache Iceberg
incubator-amoro
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
incubator-livy
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
inlong
Apache InLong - a one-stop integration framework for massive data
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
leetcode
leetcode做题记录。之前的记录再OneNote上,不过多年下来发现OneNote并不适合记录leetcode这类问题
linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
metabase
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
nebula-algorithm
Nebula-Algorithm is a Spark Application based on GraphX, which enables state of art Graph Algorithms to run on top of NebulaGraph and write back results to NebulaGraph.
Obsidian-notes
Used for synchronizing Obsidian notes
ripgrep
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
spark
Apache Spark - A unified analytics engine for large-scale data processing
spark-clickhouse-connector
Spark ClickHouse Connector build on DataSourceV2 API
spark-jobserver
REST job server for Apache Spark
ssiuAliyunController
my AliyunController