Aaron Wang's repositories
Online_examSystem
SpringBoot+Vue 在线考试系统
Spark-Proxy
push-based calculation for spark application
chatgpt-java
ChatGPT SDK and CLI for Java
2021-Data-Intensive-Computing-PJ
Final project for 2021 fall Theory and practice of data-intensive computing
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
fluid
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
LSTM-based-log-level-prediction
Implementation of log level prediction.
optimal-parallel-fp-growth
Final project for 2022 spring BigData Management
github-readme-stats
:zap: Dynamically generated stats for your github readmes
incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
incubator-hugegraph-ai
The integration of HugeGraph with artificial intelligence
incubator-hugegraph-computer
HugeGraph Computer - A distributed graph processing system for hugegraph (OLAP)
incubator-hugegraph-doc
HugeGraph Website and Doc
incubator-kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
kruise
Automated management of large-scale applications on Kubernetes (incubating project under CNCF)
Lihang-Code-SE
李航第二版 《统计学习方法》算法代码实现
personal-tools
Some convenient scripts&tools for personal use
spark
Apache Spark - A unified analytics engine for large-scale data processing
thrift-generalize
泛化Thrift接口,增加服务不用修改客户端代码的方案
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
tugraph-analytics
TuGraph-analytics is a distribute streaming graph computing engine.