xiuqingyao's starred repositories

Python-100-Days

Python - 100天从新手到大师

Coursera-ML-AndrewNg-Notes

吴恩达老师的机器学习课程个人笔记

pumpkin-book

《机器学习》(西瓜书)公式详解

ddia

《Designing Data-Intensive Application》DDIA中文翻译

Language:PythonLicense:CC-BY-4.0Stargazers:19407Issues:362Issues:71

flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Language:JavaLicense:Apache-2.0Stargazers:14319Issues:516Issues:0

xg2xg

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

debezium

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Language:JavaLicense:Apache-2.0Stargazers:10030Issues:209Issues:0

juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Language:GoLicense:Apache-2.0Stargazers:9938Issues:112Issues:1284

God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

datax-web

DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。

Language:JavaLicense:MITStargazers:5394Issues:132Issues:514

flink-training-course

Flink 中文视频课程(持续更新...)

maxwell

Maxwell's daemon, a mysql-to-json kafka producer

Language:JavaLicense:NOASSERTIONStargazers:3939Issues:524Issues:1084

mm-wiki

MM-Wiki 一个轻量级的企业知识分享与团队协同软件,可用于快速构建企业 Wiki 和团队知识分享平台。部署方便,使用简单,帮助团队构建一个信息共享、文档管理的协作环境。

linkis

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

Language:JavaLicense:Apache-2.0Stargazers:3251Issues:262Issues:2529

pimcore

Core Framework for the Open Source Data & Experience Management Platform (PIM, MDM, CDP, DAM, DXP/CMS & Digital Commerce)

Language:PHPLicense:NOASSERTIONStargazers:3227Issues:180Issues:6620

DataSphereStudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Language:JavaLicense:Apache-2.0Stargazers:2984Issues:181Issues:742

awesome-mpc

A curated list of multi party computation resources and links.

Database-Notes

📚深入浅出数据库存储:数据库理论、关系型数据库、文档型数据库、键值型数据库、New SQL、搜索引擎、数据仓库与 OLAP、大数据与数据中台

Language:HTMLLicense:NOASSERTIONStargazers:1078Issues:39Issues:0

cdhproject

hadoop各组件使用,持续更新

xiaohouzi

小猴子最新后台网站 www.xiaohouzilaaa.site 小猴子安卓版https://raw.githubusercontent.com/xiaohouzivpn/xiaohouzi/master/xiaohouzijiasuqi.apk 小猴子 pc版本 https://raw.githubusercontent.com/xiaohouzivpn/xiaohouzi/master/xiaohouzipc.rar

spark-bench

Benchmark Suite for Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:238Issues:34Issues:129

azkaban_assistant

azkaban小助手,增加任务web配置、远程脚本调用、报警扩展、跨项目依赖等功能。

Language:PythonLicense:Apache-2.0Stargazers:118Issues:12Issues:16

DDW

分布式数据仓库最佳实践

SpringBoot-Learn

Learning about the spring-boot instances

cdh-autouninstall

基于CDH5.x parcles安装,一键卸载脚本

logminer-kafka-connect

CDC Kafka Connect source for Oracle Databases leveraging Oracle Logminer

Language:KotlinLicense:Apache-2.0Stargazers:30Issues:9Issues:16

idcard-resolution

身份证号解析的小工具,可以根据输入的身份证号输出,此身份证号的省份,城市,区县,性别,出生日期

Language:JavaStargazers:25Issues:2Issues:0

java_rsa_poc

Generate RSA public and private KEY in two format and crypt and decrypt text

Language:JavaStargazers:1Issues:0Issues:0