lizu18xz's repositories

web-spark

SpringBoot + Spark

Language:ScalaStargazers:3Issues:2Issues:0

Addax

Addax(formerly DataX) is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

bootcamp

Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

calcite-for-sql

使用calcite解析sql语句,输出sql中各个部分。可用于sql血缘分析。

Stargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0

dexecutor-core

Execute Dependent/Independent tasks in a reliable way

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

dolphinscheduler

Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

examples-scala

Stream Processing with Apache Flink - Scala Examples

License:Apache-2.0Stargazers:1Issues:0Issues:0

fim

简单的消息推送系统

Language:JavaStargazers:1Issues:2Issues:0

flinkful

flink endpoint for open world

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

incubator-hugegraph

A graph database that supports more than 100+ billion data, high performance and scalability (Include OLTP Engine & REST-API & Backends)

License:Apache-2.0Stargazers:1Issues:0Issues:0

incubator-seatunnel

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

intelligence

学习ES

Language:JavaStargazers:1Issues:2Issues:0

miaoshaStable

学习一些高并发的技巧

Language:JavaStargazers:1Issues:0Issues:0

mlsql

The Programming Language Designed For Big Data and AI

Language:JavaScriptLicense:Apache-2.0Stargazers:1Issues:1Issues:0

scaleph

Open data platform based on flink and kubernetes. Now scaleph is supporting data integration with seatunnel on flink

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

License:Apache-2.0Stargazers:1Issues:0Issues:0

spark-extend-dataSource

spark自定义外部数据源demo

Language:ScalaStargazers:1Issues:1Issues:0

spark-field-lineage

spark 字段血缘 spark field lineage

Stargazers:1Issues:0Issues:0
Language:DockerfileStargazers:1Issues:2Issues:0

spark-tpcds-datagen

All the things about TPC-DS in Apache Spark

License:Apache-2.0Stargazers:1Issues:0Issues:0

sql-parser

基于antlr4的sql解析,实现格式化,元数据,血源等自定义解析,包括hive

Language:ANTLRLicense:Apache-2.0Stargazers:1Issues:0Issues:0

sqlSubmit

基于 Flink 的 sqlSubmit 程序

Language:JavaLicense:GPL-3.0Stargazers:1Issues:0Issues:0

ssc

学习 自己实现一个简单的配置中心

Language:JavaStargazers:1Issues:2Issues:0
Stargazers:1Issues:0Issues:0

big-data-algorithm

一些算法包

Stargazers:0Issues:0Issues:0

datalake-example

Data lake implementation demo, include iceberg on flink, iceberg on spark, hudi on flink, hudi on spark

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:DockerfileStargazers:0Issues:0Issues:0

transition-ticket

B站会员购 抢票脚本

License:GPL-3.0Stargazers:0Issues:0Issues:0