shursulei's repositories
feishu_api_data
飞书api入仓
hudi-application
hudi代码开发
hudi
Upserts, Deletes And Incremental Processing on Big Data.
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
bahir-flink
Mirror of Apache Bahir Flink
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
scaleph
Open data platform based on flink. Now scaleph is supporting data integration with seatunnel on flink
dataease
人人可用的开源数据可视化分析工具。
flinkful
flink endpoint for open world
grafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
inlong
Apache InLong - a one-stop integration framework for massive data
datart
Datart is a next generation Data Visualization Open Platform
hudi-resources
汇总Apache Hudi相关资料
incubator-shenyu-dashboard
Apache ShenYu Dashboard
incubator-shenyu
ShenYu is High-Performance Java API Gateway.
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
MetaSpore
A unified end-to-end machine intelligence platform
nifi
Apache NiFi
flowman
Flowman is a Spark based data build tool. By using high level flow specifications with YAML files, Flowman simplifies the development of data pipelines.
hue
Open source SQL Query Assistant service for Databases/Warehouses
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
mlflow
Open source platform for the machine learning lifecycle
bitmapudf
hive udf 读写存储到hbase的roaringbitmap
DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Prophecis
Prophecis is a one-stop cloud native machine learning platform.
hop
Hop Orchestration Platform
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
meltano
ELT for the DataOps era- open source data integration tool. This is a read-only mirror of https://gitlab.com/meltano/meltano
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.