Oswaldo Fuenmayor's starred repositories
MapReduce-Performance_Testing
MapReduce performance testing using teragen and terasort
terraform-aws-eks
Terraform module to create Amazon Elastic Kubernetes (EKS) resources 🇺🇦
terragrunt
Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.
terraform
Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared amongst team members, treated as code, edited, reviewed, and versioned.
pyenv-virtualenv
a pyenv plugin to manage virtualenv (a.k.a. python-virtualenv)
docker-airflow
Docker Apache Airflow
Miscellaneous
Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark, tools for performance testing CPUs, Jupyter notebooks examples for Spark, examples for Oracle and other DB systems.
spark-tpcds-datagen
All the things about TPC-DS in Apache Spark
spark-tpc-ds-performance-test
Use the TPC-DS benchmark to test Spark SQL performance
spark-daria
Essential Spark extensions and helper methods ✨😲
spark-style-guide
Spark style guide
awesome-spark
A curated list of awesome Apache Spark packages and resources.
airflow-maintenance-dags
A series of DAGs/Workflows to help maintain the operation of Airflow
jvm-profiler
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository