Elie Kawerk's repositories
data-engineering-zoomcamp
Code for Data Engineer Zoomcamp course
business-science-lucaso21.github.io
Reop for code and resources for personal website
causal_debiased_ranking
We will show how to factorize and debias ranking to improve personalization and reduce popularity bias both on item side and to reduce the dominance of power users.
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
DataEngineer-io-bootcamp3-homework-submission-anjala.lahan
DataEngineer-io bootcamp3 homework submission <Combined Track> <Submitted by Anjala Abdul Rehman>
dataengineerio-capstone-ryanbrown
capstone project for Dataengineer.io bootcamp Public Repo
ebnerd-benchmark
Ekstra Bladet Recommender System repository for benchmarking the EBNeRD dataset.
forecasting-with-machine-learning
Code repository for the course "Forecasting with Machine Learning Models"
kd_early_ranker
Covers a couple of approaches to training an early ranker with knowledge distillation from final ranker
Matteo-Courthoud-Blog-Posts
Code and notebooks for my Medium blog posts
ml-engineering
Machine Learning Engineering Open Book
ml.school
Machine Learning School
mlops-python-package
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
mlops-zoomcamp
Free MLOps course from DataTalks.Club
practical-data-engineering
Real estate dagster pipeline
reddit-dataengineerio
repo holding reddit etl process
reservation_cancellation_prediction
Predict if a reservation will be canceled using robust Machine Learning pipelines with Airflow and Mlflow
reward_maximizing_ranking
Adding REINFORCE based reward maximization to pointwise ranking
serverless-ml-course
Serverless Machine Learning Course for building AI-enabled Prediction Services from models and features
sml-project-2023-manfredi-meneghin
Scalable Machine Learning and Deep Learning, Final Project, 2023/2024
spotify-stream-analytics
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.
viberary
Good books, good vibes
wave_height_prediction_huntington
Wave height prediction for the Huntington beach in California, USA.
ZachV3Bootcamp
Repository