jacopo gobbi's repositories
hay_checker
Data quality metrics in a distributed (spark) or centralized fashion.
approximated_personalized_pagerank
c++11 implementation of approximated personalized pagerank algorithms.
jobs_scraping
Scrape, organize and present job offers and companies. Dockerized and good to go with a docker-compose up.
airflow_hyperparameters_search
Quick setup for a dockerized and scalable hyperparameter search for ML models using airflow.
anomaly-detection-project
data mining project about anomaly detection of time series
argo-workflows
Workflow engine for Kubernetes
backprop_vs_bio
Backpropagation vs bio-inspired for tiny networks
dockerized_spark_cluster_notebook
Interfacing a spark cluster through a python notebook
Count-min-sketch
c++11 Count-min sketch implementation
curriculum
Curriculum and some certifications
enterprise_gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
js-cellular-automata
Drawing cellular automatas with js.
kendall
Header only implementation of the algorithm from "A Computer Method for Calculating Kendall's Tau with Ungrouped Data " by William R. Knight.
orchest
Build data pipelines, the easy way 🛠️
pg_jsonschema
Fork to play around with extensions
semantic-loss-pytorch
PyPSDD porting to Python 3 + PyTorch equivalent tree construction.
unitn_cv_2018_project
Hasty implementation of a top view people tracker for the computer vision course in unitn.