lzm7455's repositories
pyod
A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
featuretools
An open source python framework for automated feature engineering
pwc
Papers with code. Sorted by stars. Updated weekly.
Cpp-Primer
C++ Primer 5 answers
XX-Net
a web proxy tool
palo
Palo,the MPP data warehouse
atlas
Mirror of Apache Atlas
CBoard
An easy to use, self-service open BI reporting and BI dashboard platform.
incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
PLMCodeTemplate
给部门制定的代码框架模板
geo
S2 geometry library in Go
hadoop-lzo
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
mastering-apache-spark-book
Mastering Apache Spark 2
metrics
:chart_with_upwards_trend: Capturing JVM- and application-level metrics. So you know what's going on.
Apache-Flink-Docs-ZH-translation
Apache Flink官方文档中文翻译计划
spark-workshop
Materials (slides and code) for Spark Workshops
cm_api
Cloudera Manager API Client
KAP-manual
Kyligence Analytics Platform Manual
falcon
Mirror of Apache Falcon
geohash-java
Implementation of GeoHashes in java. We try to be/stay compliant to the spec, as far as possible.
spark
Mirror of Apache Spark
word_cloud
A little word cloud generator in Python
incubator-atlas
Mirror of Apache Atlas (Incubating)
hbase
Mirror of Apache HBase
pinot
A realtime distributed OLAP datastore
go
The Go programming language
tidb
TiDB is a distributed NewSQL database compatible with MySQL protocol
tikv
Distributed transactional key value database powered by Rust and Raft
indexr
An open-source columnar data format designed for fast & realtime analytic with big data.