Dezhi Cai's repositories
500lines
500 Lines or Less
AI_Tutorial
精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总
Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
apisix
The Cloud-Native API Gateway
avro-random-generator
Used to generate mock Avro data
awesome-lowcode
国内低代码平台从业者交流
awesome-risk-control
风控知识总结
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
educative-courses
A collection of courses, scraped from the website Educative (educative.io). Feel free to use!
etl-with-airflow
ETL best practices with airflow, with examples
flink
Apache Flink
flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
genie
Distributed Big Data Orchestration Service
grokking-system-design
Grokking system design
hive
Apache Hive
hudi
Upserts, Deletes And Incremental Processing on Big Data.
hudi-demos
汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)
iceberg
Apache Iceberg
incubator-superset
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
interview_internal_reference
2020年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
kylin
Apache Kylin
logging-log4j2
Mirror of Apache Logging Log4J2
lucene-solr
Apache Lucene and Solr open-source search software
minibase
An embedded KV storage engine for learning HBase
shiro
Mirror of Apache Shiro
spark
Apache Spark
stateful-functions
Stateful Functions for Apache Flink
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
tabula
Tabula is a tool for liberating data tables trapped inside PDF files
telegraf
The plugin-driven server agent for collecting & reporting metrics.