Dezhi Cai's repositories
spark
Apache Spark
flink
Apache Flink
AI_Tutorial
精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总
hudi
Upserts, Deletes And Incremental Processing on Big Data.
flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
awesome-lowcode
国内低代码平台从业者交流
apisix
The Cloud-Native API Gateway
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
kylin
Apache Kylin
avro-random-generator
Used to generate mock Avro data
genie
Distributed Big Data Orchestration Service
iceberg
Apache Iceberg
interview_internal_reference
2020年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
etl-with-airflow
ETL best practices with airflow, with examples
minibase
An embedded KV storage engine for learning HBase
hudi-demos
汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
educative-courses
A collection of courses, scraped from the website Educative (educative.io). Feel free to use!
500lines
500 Lines or Less
grokking-system-design
Grokking system design
awesome-risk-control
风控知识总结
telegraf
The plugin-driven server agent for collecting & reporting metrics.
incubator-superset
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
lucene-solr
Apache Lucene and Solr open-source search software
logging-log4j2
Mirror of Apache Logging Log4J2
stateful-functions
Stateful Functions for Apache Flink
Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
hive
Apache Hive
shiro
Mirror of Apache Shiro
tabula
Tabula is a tool for liberating data tables trapped inside PDF files