MAX.H's repositories

awesome-etl

A curated list of awesome ETL frameworks, libraries, and software.

Stargazers:0Issues:0Issues:0

awesome-python

A curated list of awesome Python frameworks, libraries, software and resources

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

big-data-landscape

Big Data Landscape (by www.qaware.de)

License:GPL-3.0Stargazers:0Issues:1Issues:0

big-data-plugin

Kettle plugin that provides support for interacting within many "big data" projects including Hadoop, Hive, HBase, Cassandra, MongoDB, and others.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

bigdata

big data technology

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

cubes

Light-weight Python OLAP framework for multi-dimensional data analysis

Language:PythonLicense:NOASSERTIONStargazers:0Issues:3Issues:0

datacollector

StreamSets Data Collector - Continuous big data and cloud platform ingest infrastructure

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

DataQuality

DataQuality for BigData

Language:ScalaLicense:LGPL-3.0Stargazers:0Issues:3Issues:0

debezium

Change data capture for a variety of databases. https://debezium.io Please log issues in our JIRA at https://issues.jboss.org/projects/DBZ/issues

License:Apache-2.0Stargazers:0Issues:0Issues:0

FATE

An Industrial Level Federated Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

flink-learning

flink learning blog. http://www.54tianzhisheng.cn 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

License:Apache-2.0Stargazers:0Issues:0Issues:0

gpdb

Greenplum Database

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0

gradict-charts-doc

图之典 - 图表文档

License:MITStargazers:0Issues:0Issues:0

hadoopecosystemtable.github.io

This page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open source, free software environment.

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:2Issues:0

Hystrix

Hystrix is a latency and fault tolerance library designed to isolate points of access to remote systems, services and 3rd party libraries, stop cascading failure and enable resilience in complex distributed systems where failure is inevitable.

Language:JavaStargazers:0Issues:1Issues:0

ignite-learning-paths-training

Ignite Learning Path Training

License:CC-BY-4.0Stargazers:0Issues:2Issues:0

incubator-hop

Hop Orchestration Platform

License:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-hudi

Upserts And Incremental Processing on Big Data

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

incubator-superset

Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:3Issues:0

Java

All Algorithms implemented in Java

Language:JavaStargazers:0Issues:0Issues:0

kettle-scheduler

一款简单易用的Kettle调度监控平台,专门用来调度和监控由kettle客户端创建的job和transformation。整体的框架是由spring+sprin gmvc +beetlsql整合而成,通过调用kettle的API来执行转换和作业,并且使用quartz框架完成调度工作。

License:Apache-2.0Stargazers:0Issues:0Issues:0

kudu

Mirror of Apache Kudu

License:Apache-2.0Stargazers:0Issues:0Issues:0

Machine-Learning-Study-Path-March-2019

A complete ML study path, focused on TensorFlow and Scikit-Learn

Stargazers:0Issues:0Issues:0

openGauss-server

openGauss kernel

License:NOASSERTIONStargazers:0Issues:0Issues:0

pentaho-kettle

Pentaho Data Integration ( ETL ) a.k.a Kettle

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

presto-1

Official home of Presto, the distributed SQL query engine for big data

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:3Issues:0

spark-clickhouse

spark to yandex clickhouse connector

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

spring-cloud-dataflow

Spring Cloud Data Flow is a toolkit for building data integration and real-time data processing pipelines.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:3Issues:0

waterdrop

生产环境的海量数据计算产品,文档地址:

License:Apache-2.0Stargazers:0Issues:0Issues:0