Jairo Souza's starred repositories
findpapers
Findpapers: A tool for helping researchers who are looking for related works
safaribooks
Download and generate EPUB of your favorite books from O'Reilly Learning (aka Safari Books Online) library.
learning-notes
Notes on books I read, talks I watch, articles I study, and papers I love
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
astronomer-cosmos
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
awesome-readme
A curated list of awesome READMEs
data-engineering-zoomcamp
Free Data Engineering course!
ligar-cobranca
Ligue automaticamente para empresas de cobrança e deixe uma voz falando "Alô?" sem parar.
repodriller
a tool to support researchers on mining software repositories studies
awesome-apache-airflow
Curated list of resources about Apache Airflow
airflow-testing-ci-workflow
(project & tutorial) dag pipeline tests + ci/cd setup
Data-Pipelines-with-Airflow
This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging the data, filling the data warehouse, and running checks on the data quality as the final step. Automate the ETL pipeline and creation of data warehouse using Apache Airflow. Skills include: Using Airflow to automate ETL pipelines using Airflow, Python, Amazon Redshift. Writing custom operators to perform tasks such as staging data, filling the data warehouse, and validation through data quality checks. Transforming data from various sources into a star schema optimized for the analytics team’s use cases. Technologies used: Apache Airflow, S3, Amazon Redshift, Python.
airflow-pentaho-plugin
Pentaho plugin for Apache Airflow - Orquestate pentaho transformations and jobs from Airflow
awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
awesome-mlops
A curated list of references for MLOps
form-to-google-sheets
Store HTML form submissions in Google Sheets.
awesome-seml
A curated list of articles that cover the software engineering best practices for building machine learning applications.
pentaho-pdi-dataset
Set of PDI plugins to more easily work with data sets. We also want to provide unit testing capabilities through input data sets and golden data sets.