Mihály Hazag's starred repositories

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Language:PythonLicense:UnlicenseStargazers:132410Issues:2193Issues:26630

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:94882Issues:691Issues:7867

ML-From-Scratch

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Language:PythonLicense:MITStargazers:23993Issues:939Issues:70

mlflow

Open source platform for the machine learning lifecycle

Language:PythonLicense:Apache-2.0Stargazers:18770Issues:303Issues:3939

azure-quickstart-templates

Azure Quickstart Templates

Language:BicepLicense:MITStargazers:14070Issues:736Issues:1538

datahub

The Metadata Platform for your Data and AI Stack

Language:JavaLicense:Apache-2.0Stargazers:9910Issues:251Issues:2221

s3fs-fuse

FUSE-based file system backed by Amazon S3

Language:C++License:GPL-2.0Stargazers:8670Issues:174Issues:1244

materials

Bonus materials, exercises, and example projects for our Python tutorials

Language:HTMLLicense:MITStargazers:4805Issues:157Issues:59

terraform-provider-azurerm

Terraform provider for Azure Resource Manager

Language:GoLicense:MPL-2.0Stargazers:4604Issues:239Issues:14254

aws-eks-best-practices

A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.

Language:PythonLicense:NOASSERTIONStargazers:2043Issues:87Issues:186

large-language-models

Notebooks for Large Language Models (LLMs) Specialization

Language:PythonLicense:NOASSERTIONStargazers:766Issues:25Issues:9

awesome-public-real-time-datasets

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

License:CC0-1.0Stargazers:611Issues:8Issues:0

data-science-your-way

Ways of doing Data Science Engineering and Machine Learning in R and Python

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:594Issues:52Issues:1

spark-xml

XML data source for Spark SQL and DataFrames

Language:ScalaLicense:Apache-2.0Stargazers:505Issues:39Issues:426

complete-dbt-bootcamp-zero-to-hero

Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course

Language:ShellLicense:NOASSERTIONStargazers:455Issues:8Issues:0

AzureDatabricksBestPractices

Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs

eks-workshop-v2

Hands-on labs for Amazon EKS

Language:HCLLicense:Apache-2.0Stargazers:441Issues:29Issues:251

mlflow-example

An example MLflow project

Language:PythonLicense:Apache-2.0Stargazers:237Issues:22Issues:9

mlflow-workshop-part-1

Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this three part series, we will cover MLflow Tracking, Projects, Models, and Model Registry.

License:Apache-2.0Stargazers:234Issues:10Issues:0

amqpstorm

Thread-safe Python RabbitMQ Client & Management library

Language:PythonLicense:MITStargazers:188Issues:20Issues:72

mlp-regression-template

Example repo to kickstart integration with mlflow pipelines.

Language:PythonLicense:Apache-2.0Stargazers:73Issues:9Issues:11

Local-Data-LakeHouse

Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.

dlt-files-in-repos-demo

Demonstration of using Files in Repos with Databricks Delta Live Tables

Language:HCLLicense:MITStargazers:29Issues:4Issues:1

spark-ml-intro

Spark.ML introduction in Python and SparkR

Language:Jupyter NotebookStargazers:8Issues:5Issues:0