There are 1 repository under etl-components topic.
One ETL tool to rule them all
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
A framework for moving data into a data warehouse.
Source code and test material for developing ETL components for use in SD2E
Singer (ETL) Pipedrive playground (with Redash (Data Visualization))
Phone-Matchup a Phone Prediction Model which uses ETL Pipeline for data extraction.
Extract, Transformation & Load analytical worflow for INEGI data for defunciones, year 2012.
Northwind OLTP ETL Package using SSIS
Customisable ETL utility to validate, filter and merge CSV files. Off-the-shelf merges files from Google COVID-19 repository while checking the input data for errors, inconsistencies etc.
Fraud detection on mobile banking transactions
Project uses Pandas to create multiple DataFrames from CSV files containing Disneyland Reviews and Chocolate Reviews.. Cleaned those DataFrames, then loaded to PostgreSQL to create a relational database to join everything together.
Import data from GitLab to PostgreSQL with singer tap-gitlab