Xiaojian Sun (sunxiaojian)

sunxiaojian

Geek Repo

Location:BeiJing,China

Github PK Tool:Github PK Tool

Xiaojian Sun's repositories

airbyte

Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

airbyte-platform

The platform that powers Airbyte. Please file issues in https://github.com/airbytehq/airbyte

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

debezium

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flink

Apache Flink

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

paimon

Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

seatunnel

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

amoro

Arctic is a streaming lake warehouse service open sourced by NetEase

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datafusion

Apache DataFusion SQL Query Engine

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

elasticsearch

Free and Open, Distributed, RESTful Search Engine

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

flink-cdc

CDC Connectors for Apache Flink®

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

License:MITStargazers:0Issues:0Issues:0

gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hadoop

Apache Hadoop

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

helm-java

Helm client for Java

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

iceberg

Apache Iceberg

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kafka-connect-file-pulse

🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kafka-connect-paimon

kafka connect for paimon

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

llamacoder

Open source Claude Artifacts – built with Llama 3.1 405B

Stargazers:0Issues:0Issues:0

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

License:Apache-2.0Stargazers:0Issues:0Issues:0

migration

Migration tools for TiKV, e.g. online bulk load.

License:Apache-2.0Stargazers:0Issues:0Issues:0

paimon-webui

Web ui for Apache Paimon.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

parquet-mr

Apache Parquet

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pinot

Apache Pinot - A realtime distributed OLAP datastore

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

polardbx-sql

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pravega

Pravega - Streaming as a new software defined storage primitive

License:Apache-2.0Stargazers:0Issues:0Issues:0

ranger

Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond

License:Apache-2.0Stargazers:0Issues:0Issues:0

risingwave

Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

starrocks

StarRocks is a next-gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

temporal

Temporal service

License:MITStargazers:0Issues:0Issues:0