Eugene's repositories
bigdata-file-viewer
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
branch-predictor-demo
It's a Desktop application to demo the branch prediction.
pexels-photos-dumper
A crawler to get free high quality photos from pexels.com.
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
aws-doc-sdk-examples
Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.rst file below.
codebytere.github.io
personal website
FileStation
Temparary usage for file sharing.
learning-spark
Example code from Learning Spark book
pmem-shuffle
Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote persistent memory (for read) to provide extremely high performance and low latency shuffle solutions for Spark*.
redis-plus-plus
Redis client written in C++
scala-maven-scaffold
A quick start scaffold for a Scala Project using maven as build system
spark-flowchart
Flowchart for debugging Spark applications
Spark-PMoF
Spark Shuffle Optimization with RDMA+AEP
SparkInternals
Notes talking about the design and implementation of Apache Spark