Horacio Soldman's repositories
batch-processing-on-aws
With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset which is available on Transport For London (TFL) website. https://cycling.data.tfl.gov.uk
export-import-mongodb
This repository contains two scripts to export and import mongodb collections.
realtime-events-analytics
This is an end-to-end data engineering project which allows realtime analytics of a website clicks events.
playground-uehfkxm3
Tech.io playground
A-MERN-Stack-App
This project is a simple fullstack app created with MERN stack
CW2_ECOM
This is a small end-to-end e-commerce project which employs a Recommender system on the backend. The project is part of the Ecommerce module in my Msc Data Science.
data-engineering-zoomcamp
Free Data Engineering course!
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
delivery_app
Solution
DTC-DE-zoomcamp--Solutions
This repository contains my work for the Datatalksclub Data Engineering Zoomcamp.
etl-with-python
This notebook shows a simple ETL pipeline using Python
mlops-zoomcamp
Free MLOps course from DataTalks.Club
mlopszoomcamp-solutions
This repo contains some of my practical experience while taking the MLOps zoomcamp course from datatalks.club
pyspark-cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.