Mihály Hazag's starred repositories
youtube-dl
Command-line program to download videos from YouTube.com and other video sites
ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
azure-quickstart-templates
Azure Quickstart Templates
terraform-provider-azurerm
Terraform provider for Azure Resource Manager
aws-eks-best-practices
A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.
large-language-models
Notebooks for Large Language Models (LLMs) Specialization
awesome-public-real-time-datasets
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
data-science-your-way
Ways of doing Data Science Engineering and Machine Learning in R and Python
complete-dbt-bootcamp-zero-to-hero
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
AzureDatabricksBestPractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
eks-workshop-v2
Hands-on labs for Amazon EKS
mlflow-example
An example MLflow project
mlflow-workshop-part-1
Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this three part series, we will cover MLflow Tracking, Projects, Models, and Model Registry.
mlp-regression-template
Example repo to kickstart integration with mlflow pipelines.
Local-Data-LakeHouse
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
dlt-files-in-repos-demo
Demonstration of using Files in Repos with Databricks Delta Live Tables
spark-ml-intro
Spark.ML introduction in Python and SparkR