Shaofeng Shi (shaofengshi)

shaofengshi

Geek Repo

Company:Datastrato Inc

Location:Sunnyvale CA

Github PK Tool:Github PK Tool

Shaofeng Shi's repositories

emr-bootstrap-alluxio

alluxio - emr bootstrap action scripts

Language:ShellLicense:MITStargazers:4Issues:3Issues:0

kylin

Apache Kylin

Language:JavaLicense:Apache-2.0Stargazers:2Issues:1Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

alluxio

Alluxio, formerly Tachyon, Unify Data at Memory Speed

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ApacheQuiz

A multiple choice quiz of Apache Software Foundation policy

Language:CSSLicense:Apache-2.0Stargazers:1Issues:2Issues:0

Chronicle-Map

Replicate your Key Value Store across your network, with consistency, persistance and performance.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

ckman

This is a tool which used to manage and monitor ClickHouse database

Language:GoLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ClickHouse

ClickHouse is a free analytics DBMS for big data

Language:C++License:Apache-2.0Stargazers:1Issues:0Issues:0

datafuse

An elastic and scalable Cloud Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy

Language:RustLicense:Apache-2.0Stargazers:1Issues:1Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

druid

阿里巴巴数据库事业部出品,为监控而生的数据库连接池。阿里云Data Lake Analytics(https://www.aliyun.com/product/datalakeanalytics )、DRDS、TDDL 连接池powered by Druid

Language:JavaLicense:NOASSERTIONStargazers:1Issues:1Issues:0

hbase

Mirror of Apache HBase

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

incubator

Apache Incubator Website

Language:CSSStargazers:1Issues:1Issues:0

incubator-hudi

Upserts And Incremental Processing on Big Data

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

json-editor

JSON Schema Based Editor

Language:JavaScriptLicense:MITStargazers:1Issues:2Issues:0

moonbox

Moonbox is a DVtaaS (Data Virtualization as a Service) Platform

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

parquet-format

Apache Parquet

Language:JavaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

parquet-mr

Apache Parquet

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

presto

Official home of the community managed version of Presto, the distributed SQL query engine for big data, under the auspices of the Presto Software Foundation.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

Quicksql

Simpler, Safer, Faster Unified SQL Analytics Engine for Multi-Datasources

Language:JavaLicense:MITStargazers:1Issues:0Issues:0

RemoteShuffleService

Remote shuffle service for Apache Spark to store shuffle data on remote servers.

Language:JavaLicense:NOASSERTIONStargazers:1Issues:0Issues:0

SparkCube

SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.

License:Apache-2.0Stargazers:1Issues:0Issues:0

tsunami-security-scanner-plugins

This project aims to provide a central repository for many useful Tsunami Security Scanner plugins.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

apachecon-acasia

Draft page for acah2021 conference

Language:PythonStargazers:0Issues:0Issues:0

Chat2DB

🔥 🔥 🔥 An intelligent and versatile general-purpose SQL client and reporting tool for databases which integrates ChatGPT capabilities.

License:Apache-2.0Stargazers:0Issues:0Issues:0

designing-data-intensive-applications

Designing Data-Intensive Applications by Martin Kleppmann

Stargazers:0Issues:0Issues:0

gravitino

World's most powerful data catalog service with providing a high-performance, geo-distributed and federated metadata lake.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hsd-cipher-sm

国产密码算法SM2,SM3,SM4

Language:JavaStargazers:0Issues:0Issues:0

Polycat

Polycat is a cutting-edge cloud-native metastore system, purpose-built to cater to the demands of modern data management in lakehouse deployments. It offers a comprehensive solution for organizations that need to manage metadata from multiple data sources across different clouds, all in one unified platform.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0