Mario Renau's starred repositories
project-based-learning
Curated list of project-based tutorials
OpenSearch
🔎 Open source distributed and RESTful search engine.
postgresml
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
etl-with-airflow
ETL best practices with airflow, with examples
xlskubectl
xlskubectl — a spreadsheet to control your Kubernetes cluster
Daily-Dose-of-Data-Science
A collection of code snippets from the publication Daily Dose of Data Science on Substack: http://www.dailydoseofds.com/
testcontainers-scala
Docker containers for testing in scala
automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
ActivitySchema
Repository for the ActivitySchema spec and supporting materials
kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
gtfs-validator
Canonical GTFS Validator project for schedule (static) files.
CursoIntroPython
Curso de introducción a la programación con python para Launch X de Innovacción Virtual
awesome-dataops
:sunglasses: A curated list of awesome DataOps tools
spark-sql-flow-plugin
Visualize column-level data lineage in Spark SQL
analytical_dp_with_sql
Code for my "Efficient Data Processing in SQL" book.
Scala-Category-Theory
Bartosz Milewski great book on Category Theory implemented in scala, with property Tests
scalacrashcourse
Crash course in Scala
hitchhikers_guide_to_deltalake_streaming
Don't Panic. This guide will help you when it feels like the end of the world.
trino-plugins
Simplified custom plugins for Trino
dbt-data-ai-summit
Code that was used as an example during the Data+AI Summit 2020