cqboywy's repositories
dlink
Dlink & Apache Flink
DataLink
DataLink is a new open source solution to bring Flink development to data center.
flink-tutorial
flink demo
flink-learning
flink learning blog. http://www.flink-learning.com 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
DataSphereStudio
DSS covers scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, task scheduling and data exporting.
Play-with-Machine-Learning-Algorithms
Code of my MOOC Course <Play with Machine Learning Algorithms>. Updated contents and practices are also included. 我在慕课网上的课程《Python3 入门机器学习》示例代码。课程的更多更新内容及辅助练习也将逐步添加进这个代码仓。
DataQuality
数据治理->数据质量
pyspark-example-project
Example project implementing best practices for PySpark ETL jobs and applications.
Kaggle-Titanic
kaggle kernel for Titanic dataset
cdhproject
hadoop各组件使用,持续更新