wxmimperio's repositories
elasticsearch-snippets
Elasticsearch code snippets and client ops for v1.7.1 and v6.2.4
flume-multiple-taildir-source
Flume taildirsource source code extension, supports dynamic subdirectory recognition and regular filtering.
java-demos-snippets
Record daily java code use cases
bigdata-code-snippets
Record daily bigdata code use cases(hadoop、spark、hbase...)
scala-getting-started
Learning scala
api-gateway
《API网关:中间件设计和实践》—— 微服务设计,源码级体验!
BigDataGuide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
deeplearning_ai_books
deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)
flink-best-practice
flink code
geobuf-java
geobuf in java - what more can I say?
geotrellis
GeoTrellis is a geographic data processing engine for high performance applications.
git-tips
:trollface:Git的奇技淫巧
imperio-wxm.github.com
My pages blog.
jdonframework
Domain-Driven-Design Pub/Sub Domain-Events framework
kafka-streams-best-practice
kafka-streams
Miscellaneous
Includes notes on Apache Spark, Spark for Physics, Jupyter notebook examples for Spark, Oracle and other DB systems.
recipes
The Immerok Apache Flink Cookbook is a collection of examples of Apache Flink applications in the format of "recipes". Each recipe explains how you can solve a specific problem by leveraging one or more of the APIs of Apache Flink. The recipes can be extended or provide a basis for solving your requirements with Apache Flink.
spark-standalone-cluster-on-docker
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap:
sparkMeasure
This is the development repository for sparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task and stage metrics data.
SZT-bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
vertx-jdbc-client
JDBC support for Vert.x