ddkongbb

ddkongbb

Geek Repo

Github PK Tool:Github PK Tool

ddkongbb's starred repositories

alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

Language:JavaLicense:Apache-2.0Stargazers:6750Issues:0Issues:0

jpmml-model

Java Class Model API for PMML

Language:JavaLicense:BSD-3-ClauseStargazers:150Issues:0Issues:0

shifu-spark

An Alternative Spark Implementation of Shifu 'Eval' Step

Language:ScalaStargazers:1Issues:0Issues:0

shifu

An end-to-end machine learning and data mining framework on Hadoop

Language:JavaLicense:Apache-2.0Stargazers:249Issues:0Issues:0

Kylin

See Apache Kylin Website for a complete description

Language:RoffStargazers:30Issues:0Issues:0

ssb-kylin

Star Schema Benchmark Tool for Apache Kylin

Language:CLicense:Apache-2.0Stargazers:96Issues:0Issues:0

kafka-connect-elastic-sink

Kafka connect Elastic sink connector, with just in time index/delete behaviour.

Language:JavaStargazers:26Issues:0Issues:0

kafka-storm-starter

[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.

Language:ScalaLicense:NOASSERTIONStargazers:726Issues:0Issues:0

bijection

Reversible conversions between types

Language:ScalaLicense:Apache-2.0Stargazers:657Issues:0Issues:0
Language:JavaLicense:MITStargazers:48Issues:0Issues:0

kafka-tools

Collection of scripts for working with Kafka

Language:PythonStargazers:6Issues:0Issues:0

kafka-examples

Snippets and small examples demonstrating kafka features and configs

Language:JavaLicense:Apache-2.0Stargazers:638Issues:0Issues:0

BinlogAnalysis

解析Mysql binlog日志并发至Kafka

Language:JavaStargazers:23Issues:0Issues:0

kafka-connector-mysql

Kafka connector for MySQL

Language:JavaStargazers:9Issues:0Issues:0

otter

阿里巴巴分布式数据库同步系统(解决中美异地机房)

Language:JavaLicense:Apache-2.0Stargazers:8021Issues:0Issues:0

gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

Language:JavaLicense:Apache-2.0Stargazers:2206Issues:0Issues:0

canal

阿里巴巴 MySQL binlog 增量订阅&消费组件

Language:JavaLicense:Apache-2.0Stargazers:28164Issues:0Issues:0

shc

The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.

Language:ScalaLicense:Apache-2.0Stargazers:552Issues:0Issues:0

spark-summit-2017-SanFrancisco

spark summit 2017 SanFrancisco

Stargazers:98Issues:0Issues:0

aerospike-hadoop

Aerospike Hadoop Connector

Language:JavaLicense:Apache-2.0Stargazers:18Issues:0Issues:0

DeepDriver

DeepDriver is a JAVA framework of Deep Learning, it supports ANN/CNN/DNN/RNN/LSTM now, hope it can be widely used for deep learning development.

Language:JavaLicense:Apache-2.0Stargazers:98Issues:0Issues:0

debezium

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Language:JavaLicense:Apache-2.0Stargazers:10236Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:39Issues:0Issues:0

drillworkshop

Repository for the Apache Drill Workshop

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:20Issues:0Issues:0

presto_legacy

Distributed SQL query engine for big data

Language:JavaLicense:Apache-2.0Stargazers:9Issues:0Issues:0

PyHive

Python interface to Hive and Presto. 🐝

Language:PythonLicense:NOASSERTIONStargazers:1666Issues:0Issues:0

presto-python-client

Python DB-API client for Presto

Language:PythonLicense:Apache-2.0Stargazers:236Issues:0Issues:0

pydata-book

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:21763Issues:0Issues:0

spark-hbase-connector

Connect Spark to HBase for reading and writing data with ease

Language:ScalaLicense:Apache-2.0Stargazers:298Issues:0Issues:0

presto

Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data

Language:JavaLicense:Apache-2.0Stargazers:94Issues:0Issues:0