Elie Kawerk's repositories
data-engineering-zoomcamp
Code for Data Engineer Zoomcamp course
7-llm-driven-data-engineering
This is a public repository to go over all the LLM-driven data engineering concepts.
AuthoritySite
Advice on authority websites.
awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
business-science-lucaso21.github.io
Reop for code and resources for personal website
causalML-teaching
This repository consolidates my teaching material for "Causal Machine Learning".
DS4B_Python_ML_And_APIs
Repo for code and resources related Business Science Python ML & APIs course
Empirical-Bayes
Empirical Bayes Mixtape Session taught by Christopher Walters
hands-on-llms
Learn how to engineer your end-to-end LLM ecosystem: training, streaming, and inference pipelines | deploy & automate | work in progress...
kd_early_ranker
Covers a couple of approaches to training an early ranker with knowledge distillation from final ranker
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
looper-causality-stats-physics
A resource list for causality in statistics, data science and physics
Machine-Learning-Mixtape
Machine Learning and Causal Inference taught by Brigham Frandsen
Matteo-Courthoud-Blog-Posts
Code and notebooks for my Medium blog posts
Mixape-Causal-Inference-2
Causal Inference II Mixtape Session taught by Scott Cunningham
Mixtape-Causal-Inference-1
Causal Inference 1 Mixtape Session taught by Scott Cunningham
Mixtape-Shift-Share
Shift-Share Instrument Mixtape Track taught by Peter Hull
pipelined_early_ranker
Pipelined early ranker in a recommender system
recsys-llm-chatbot
A LLM based chatbot Recommender engine
reservation_cancellation_prediction
Predict if a reservation will be canceled using robust Machine Learning pipelines with Airflow and Mlflow
retail_analytics
Repo for code and resources related to a retail analytics project.
reward_maximizing_ranking
Adding REINFORCE based reward maximization to pointwise ranking
serverless-ml-course
Serverless Machine Learning Course for building AI-enabled Prediction Services from models and features
spotify-stream-analytics
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
viberary
Good books, good vibes