Mario Renau's repositories
spark-playground
Code snippets used in demos recorded for the blog.
efficient_data_processing_spark_fork
Code for "Efficient Data Processing in Spark" Course
incubator-pekko-samples_fork
Apache Pekko Sample Projects
data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
acid-file-formats
Code for Apache Hudi, Apache Iceberg and Delta Lake analysis
data-product-streaming
data-product-streaming
CursoIntroPython
Curso de introducción a la programación con python para Launch X de Innovacción Virtual
spark-daria
Essential Spark extensions and helper methods ✨😲
etl-with-airflow
ETL best practices with airflow, with examples
akka-cassandra-demo
The repository for the demonstration of Akka & Cassandra integration
datamesh
Material for the DataMesh presentation at GoDataFest 2021
a-kafka-story
Kafka ecosystem ... but step by step!
code
Example application code for the python architecture book
talos
Lawful circuit breakers for Scala. Akka and monix circuit breaker implementations with monitoring.
kubeflow-spark
Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.
machine-learning-engineering-for-production-public
Public repo for DeepLearning.AI MLEP Specialization
nessie-demos
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
presto-workload-analyzer
The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them
ml-deployment
Repo for post
bigdata_stack
Dockerized Hadoop/Minio/Hive/Presto stack
awesome-data-engineering
A curated list of data engineering tools for software developers
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.