Shixiong Zhu's repositories
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
connectors
Connectors for Delta Lake
couchbase-spark-connector
The Official Couchbase Spark Connector
delta-sharing-1
An open protocol for secure data sharing
dstream-akka
Akka data source for dstream (Spark Streaming)
dstream-flume
Flume data source for dstream (Spark Streaming)
dstream-mqtt
MQTT data source for dstream (Spark Streaming)
dstream-zeromq
ZeroMQ data source for dstream (Spark Streaming)
make-release-notes
The project that generates Scala release notes.
reactivex.github.io
ReactiveX Website
scala-style-guide
Databricks Scala Coding Style Guide
spark-perf
Performance tests for Spark