HuangWei's repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
airflow-site
Apache Airflow Website
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
febench
A Benchmark for Real-Time Relational Data Feature Extraction (VLDB'23 Best Industry Paper Runnerup)
HybridSE
A Hybird SQL Engine based on LLVM for hatp, olap, oltp, mpp, sparksql and flinksql
hybridsql-asserts
HybridSQL third-party libraries
incubator-brpc
Industrial-grade RPC framework used throughout Baidu, with 1,000,000+ instances and thousands kinds of services. "brpc" means "better RPC".
incubator-doris
Apache Doris (Incubating)
kafka-connect-jdbc
Kafka Connect connector for JDBC-compatible databases
openmldb-compose
OpenMLDB compose cluster with hdfs
openmldb-exporter
OpenMLDB metric exporter for Prometheus
pegasus
A distributed key-value storage system developed and maintained by Xiaomi Cloud Storage Team.
pulsar
Apache Pulsar - distributed pub-sub messaging system
pulsar-client-java
Pulsar producer which send JSON message
spark
This is OpenMLDB's Spark Distribution, which is particularly optimized for feature extraction. It includes a few novel techniques, such as native implementation of last join and multi-window parallelization. Its APIs are fully compatible with the standard Spark. It is designed to be a component of OpenMLDB (https://github.com/4paradigm/OpenMLDB).
zetasql
ZetaSQL - Analyzer Framework for SQL