Shaofeng Shi (shaofengshi)

shaofengshi

Geek Repo

Company:Datastrato Inc

Location:Sunnyvale CA

Github PK Tool:Github PK Tool

Shaofeng Shi's starred repositories

996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

License:NOASSERTIONStargazers:269219Issues:4222Issues:0

beego

beego is an open-source, high-performance web framework for the Go programming language.

Language:GoLicense:NOASSERTIONStargazers:31012Issues:1191Issues:3323

soar

SQL Optimizer And Rewriter

Language:GoLicense:Apache-2.0Stargazers:8605Issues:279Issues:237

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:6984Issues:216Issues:1391

log.io

Real-time log monitoring in your browser

Language:TypeScriptLicense:NOASSERTIONStargazers:4805Issues:249Issues:222

flink-training-course

Flink 中文视频课程(持续更新...)

kylin

Apache Kylin

Language:JavaLicense:Apache-2.0Stargazers:3614Issues:259Issues:0

CBoard

An easy to use, self-service open BI reporting and BI dashboard platform.

Language:JavaScriptLicense:Apache-2.0Stargazers:3015Issues:275Issues:600

dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

Language:JavaLicense:Apache-2.0Stargazers:2876Issues:38Issues:1293

ice

AWS Usage Tool

migrate

Database migrations. CLI and Golang library.

Language:GoLicense:NOASSERTIONStargazers:2292Issues:46Issues:195

gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

Language:JavaLicense:Apache-2.0Stargazers:2198Issues:166Issues:0

byzer-lang

Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.

Language:ScalaLicense:Apache-2.0Stargazers:1821Issues:117Issues:586

Github-Monitor

Github Sensitive Information Leakage Monitor(Github信息泄漏监控系统)

Language:JavaScriptLicense:GPL-3.0Stargazers:1630Issues:47Issues:115

bitsail

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.

Language:JavaLicense:Apache-2.0Stargazers:1592Issues:61Issues:212

sqle

一个支持多种不同类型数据库,覆盖事前控制、事后监督、标准发布场景,帮助您建立质量规范的SQL全生命周期质量管理平台

Language:GoLicense:MPL-2.0Stargazers:1346Issues:32Issues:818

hue

Open source SQL Query Assistant service for Databases/Warehouses

Language:JavaScriptLicense:Apache-2.0Stargazers:1105Issues:36Issues:1221

gluten

Gluten: Plugin to Double SparkSQL's Performance

Language:ScalaLicense:Apache-2.0Stargazers:920Issues:31Issues:1303

flintrock

A command-line tool for launching Apache Spark clusters.

Language:PythonLicense:Apache-2.0Stargazers:633Issues:32Issues:209
Language:ScalaLicense:Apache-2.0Stargazers:568Issues:364Issues:61

onetable

OneTable is an omni-directional converter for table formats that facilitates interoperability across data processing systems and query engines.

Language:JavaLicense:Apache-2.0Stargazers:562Issues:18Issues:144

dockbix-xxl

:whale: Dockerized Zabbix - server, web, proxy, java gateway, snmpd with additional extensions

Language:JavaScriptLicense:GPL-2.0Stargazers:378Issues:40Issues:98

delight

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

Language:ScalaLicense:NOASSERTIONStargazers:339Issues:16Issues:13

map-viz

通用的地图可视化组件

Firestorm

Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers

Language:JavaLicense:NOASSERTIONStargazers:249Issues:12Issues:48
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:189Issues:14Issues:22

RaftKeeper

RaftKeeper is a high-performance distributed consensus service.

Language:C++License:Apache-2.0Stargazers:105Issues:5Issues:105

snowplow-web-data-model

SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon

redash-kylin

Redash plugin for Apache Kylin integration

Language:PythonLicense:BSD-2-ClauseStargazers:12Issues:7Issues:1

Kylin-on-Amazon-EMR

Quick deployment of Apache Kylin on Amazon EMR

Language:ShellStargazers:9Issues:3Issues:0