WistWill's starred repositories

kuboard-press

Kuboard 是基于 Kubernetes 的微服务管理界面。同时提供 Kubernetes 免费中文教程,入门教程,最新版本的 Kubernetes v1.23.4 安装手册,(k8s install) 在线答疑,持续更新。

DataX

DataX是阿里云DataWorks数据集成的开源版本。

Language:JavaLicense:NOASSERTIONStargazers:15841Issues:451Issues:1582

God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

starrocks

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.

Language:JavaLicense:Apache-2.0Stargazers:8765Issues:213Issues:7673

seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Language:JavaLicense:Apache-2.0Stargazers:7865Issues:176Issues:3305

JSqlParser

JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern

Language:JavaLicense:Apache-2.0Stargazers:5340Issues:214Issues:1486

calcite

Apache Calcite

Language:JavaLicense:Apache-2.0Stargazers:4570Issues:168Issues:0

chunjun

A data integration framework

Language:JavaLicense:Apache-2.0Stargazers:3983Issues:168Issues:1209

dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

Language:JavaLicense:Apache-2.0Stargazers:3087Issues:38Issues:1420

LarkMidTable

LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。

Language:JavaLicense:Apache-2.0Stargazers:1806Issues:65Issues:59

queryparser

Parsing and analysis of Vertica, Hive, and Presto SQL.

Language:HaskellLicense:MITStargazers:1074Issues:57Issues:36

hadoop_study

定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)

flink_second_understand

该仓库专注于让读者秒懂Flink组件,包含Flink实战代码和文档、200个Flink教程知识点,Flink Datastream、Flink Table、Flink Window、Flink State、Flink Checkpoint、Flink Metrics、Flink Memory、Flink on standalone /yarn/k8s、Flink SQL、Flink CEP、Flink CDC、Flink UDF、PyFlink、Flink新特性、Flink Partition、Flink Memory等知识点。详细链接请看:https ://mp.weixin.qq.com/mp /appmsgalbum?__biz=Mzg5NDY3NzIwMA==&action=getalbum&album_id=2038088622687469575#wechat_redirect

Language:JavaLicense:Apache-2.0Stargazers:679Issues:11Issues:0

study

Java全栈工程师学习笔记;Spring、shiro、CAS、oauth2单点登录;cache 、Redis; web 安全及解决思路;redis、mq、quartz、docker;Docker各种组件实践等;mybatis、spring、spring boot实践;分布式锁;基于分库分表等等;Java full-stack engineer study notes; Spring, shiro, CAS, oauth2 single sign-on; cache, Redis; web security and solutions; redis, mq, quartz, docker; Docker various component practices, etc.;

data-warehouse-learning

【2024最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。

Language:JavaLicense:Artistic-2.0Stargazers:392Issues:4Issues:0

BigData-Getting-Started

大数据相关框架实战项目(Hadoop, Spark, Storm, Flink)

Language:JavaStargazers:295Issues:7Issues:0

vue-metadata

元数据管理系统(leap metadata)是用于企业内进行元数据管理的Web应用程序。该系统提供直观的用户页面。用户可以使用该系统管理企业所拥有的元数据,如查看元数据基本信息、数据所在位置、数据的血缘关系等。本系统支持管理员和普通用户的分级操作。

Language:VueStargazers:278Issues:9Issues:0

spark-atlas-connector

A Spark Atlas connector to track data lineage in Apache Atlas

Language:ScalaLicense:Apache-2.0Stargazers:264Issues:20Issues:168

dataCompare

big data comparison and data profiling platform: low code,data comparison and data profiling

Language:JavaLicense:Apache-2.0Stargazers:252Issues:7Issues:7

jiron-cloud

该项目整合了多款优秀的开源产品,构建了一个功能全面的数据开发平台。平台提供了强大的数据集成、数据开发、数据查询、数据服务、数据质量管理、工作流调度和元数据管理功能。#dinky #dolphinscheduler #datavines #flinkcdc #openmetadata #flink #数据开发 #数据平台 # 数据开发平台 #大数据

Language:JavaLicense:Apache-2.0Stargazers:215Issues:5Issues:1

neo4j-etl

Data import from relational databases to Neo4j.

Language:HTMLLicense:NOASSERTIONStargazers:214Issues:38Issues:95

dolphinscheduler-website

Apache DolphinScheduler website

Language:JavaScriptLicense:Apache-2.0Stargazers:133Issues:21Issues:0

data-lineage-parent

数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘

metadata-hive-hook

Hive hook, obtain task information from Hive, fetch input/output tables and lineage information from HSQL.

Language:JavaLicense:Apache-2.0Stargazers:39Issues:3Issues:3

seatunnel-example

seatunnel plugin developing examples.

metacat

Data repository software that helps researchers preserve, share, and discover data

Language:JavaLicense:GPL-2.0Stargazers:26Issues:16Issues:1661

spark-atlas-listener

Spark SQL 元数据管理

spring-boot-docker

docker部署Spring Boot应用

Language:JavaStargazers:3Issues:1Issues:0

Ryven

Flow-based visual scripting for Python

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

sql_analysis

基于jsqlparser 进行sql语句解析,得到字段间的血缘关系(即所查询字段来源于哪张表中的哪个字段)

Language:JavaStargazers:1Issues:1Issues:0