kowalskyyy999 / POC-Lakehouse

Proof Of Concept implementation of data lakehouse. Using Spark as analytical engine, Delta lake as storage layer, MLFlow as Machine Learning model tracking. Titanic dataset as sample data for ingestion and create a model to predict a passenger survived

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

Proof Of Concept implementation of data lakehouse. Using Spark as analytical engine, Delta lake as storage layer, MLFlow as Machine Learning model tracking. Titanic dataset as sample data for ingestion and create a model to predict a passenger survived


Languages

Language:Python 94.1%Language:Shell 5.9%