Jerry Shao (jerryshao)

jerryshao

Geek Repo

Location:Sunnyvale

Github PK Tool:Github PK Tool

Jerry Shao's repositories

spark-kafka-0-8-sql

Spark Structured Streaming Kafka 0.8 Source Implementation

Language:ScalaLicense:Apache-2.0Stargazers:35Issues:9Issues:6

spark-hive-streaming-sink

A sink to save Spark Structured Streaming DataFrame into Hive table

streaming-demo

A Spark Streaming demo framework that implements and improves the functions of Twitter Rainbird

Language:ScalaStargazers:15Issues:7Issues:0

jerryshao.github.com

my jekyll web page, forked from jekyll

Language:HTMLLicense:Apache-2.0Stargazers:8Issues:2Issues:0

spark2-ambari-definition

Ambari definition to install Spark 2.0

spark-atlas-connector

A Spark Atlas connector to track data lineage in Apache Atlas

Language:ScalaLicense:Apache-2.0Stargazers:6Issues:2Issues:0

spark-streaming-kafka-0-10-connector

A Kafka 0.10 connector for Spark 1.x Streaming

Language:ScalaLicense:Apache-2.0Stargazers:5Issues:4Issues:2

spark-website

Mirror of Apache Spark Website

Language:HTMLLicense:Apache-2.0Stargazers:2Issues:2Issues:0

awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

License:MITStargazers:1Issues:2Issues:0

gravitino

A high-performance, geo-distributed and federated metadata lake

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:2

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

incubator-iceberg

Apache Iceberg (Incubating)

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

incubator-livy

Mirror of Apache livy (Incubating)

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

incubator-spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

incubator-uniffle

Uniffle is a high performance, general purpose Remote Shuffle Service.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0

kafka-input-format

A Kafka input format used in Hadoop or Spark for batch reading data from Kafka

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:5Issues:0

spark

Scala framework for iterative and interactive cluster computing.

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:4Issues:0

spark-terasort

Spark Terasort

Language:JavaLicense:Apache-2.0Stargazers:1Issues:3Issues:0

apache-spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:3Issues:0
Language:PHPStargazers:0Issues:2Issues:0

gravitino-playground

A playground to experience Gravitino

License:Apache-2.0Stargazers:0Issues:0Issues:0

HiBench

HiBench is a big data benchmark suite.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

hive

Mirror of Apache Hive

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

incubator-livy-website

Mirror of Apache livy (Incubating)

Language:CSSLicense:Apache-2.0Stargazers:0Issues:2Issues:0

livy

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Language:ScalaStargazers:0Issues:3Issues:0

Mastering-Machine-Learning-with-Spark

Mastering Machine Learning with Spark勘误

Stargazers:0Issues:2Issues:1
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

storm-test-framework

storm performance test framework

Language:JavaStargazers:0Issues:2Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

zeppelin

Mirror of Apache Zeppelin

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0