Oswaldo Fuenmayor's repositories
docker-airflow
Docker Apache Airflow
amazon-sagemaker-architecting-for-ml
Materials for a 3-day instructor led course on applying machine learning
dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
sparklint
A tool for monitoring and tuning Spark jobs for efficiency.
spark-scala-wordcount2
Wordcount example using Spark with Scala
spark-testing-base
Base classes to use when writing tests with Spark
genie
Distributed Big Data Orchestration Service
spark
Mirror of Apache Spark
spark-bench
Benchmark Suite for Apache Spark
mastering-apache-spark-book
Mastering Apache Spark 2
Head-First-Design-Patterns
Code for Head First Design Patterns book (2014)
sqoop-on-spark
Sqoop on Apache Spark Engine
learning-spark
Example code from Learning Spark book
spark-sql-on-hbase-cdh
Add new branch for compatibility with CDH based on this repo: https://github.com/Huawei-Spark/Spark-SQL-on-HBase
Spark-SQL-on-HBase
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces