Patrick Woody's repositories
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
atlasdb
Transactional Distributed Database Layer
gradle-spark
A quick template project for making Spark applications
grpc-java
The Java gRPC implementation. HTTP/2 based RPC
hadoop-crypto
Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.
kafka-docker
Dockerfile for Apache Kafka
parquet-mr
Mirror of Apache Parquet
resource-identifier
Common resource identifier specification for inter-application object sharing
spark
Mirror of Apache Spark
spark-xml
XML data source for Spark SQL and DataFrames