Carlos Leyson's starred repositories

computer-science

:mortar_board: Path to a free self-taught education in Computer Science!

awesome-public-datasets

A topic-centric list of HQ open datasets.

dive

A tool for exploring each layer in a docker image

awesome-computer-vision

A curated list of awesome computer vision resources

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Language:Jupyter NotebookStargazers:12793Issues:296Issues:820

bocker

Docker implemented in around 100 lines of bash

Language:ShellLicense:GPL-3.0Stargazers:11158Issues:273Issues:15

nn-zero-to-hero

Neural Networks: Zero to Hero

Language:Jupyter NotebookLicense:MITStargazers:10659Issues:274Issues:27

machine-learning-interview

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:5684Issues:52Issues:1625

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3642Issues:60Issues:92

pandera

A light-weight, flexible, and expressive statistical data testing library

Language:PythonLicense:MITStargazers:3084Issues:18Issues:798

mlops-course

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Language:Jupyter NotebookLicense:MITStargazers:2785Issues:54Issues:19

nba_api

An API Client package to access the APIs for NBA.com

Language:PythonLicense:MITStargazers:2324Issues:89Issues:299

course22

The fast.ai course notebooks

Language:Jupyter NotebookStargazers:2227Issues:48Issues:71

dmls-book

Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)

MLE-Flashcards

200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.

DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Language:PythonLicense:Apache-2.0Stargazers:1372Issues:21Issues:175

dstack

An open-source container orchestration engine for running AI workloads in any cloud or data center. https://discord.gg/u8SmfwPpMd

Language:PythonLicense:MPL-2.0Stargazers:1145Issues:10Issues:706

python-versioneer

version-string management for VCS-controlled trees

Language:PythonLicense:UnlicenseStargazers:1063Issues:19Issues:212

hamilton

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

Language:PythonLicense:BSD-3-Clause-ClearStargazers:865Issues:20Issues:106
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:461Issues:22Issues:12

awesome-online-machine-learning

:bookmark_tabs: Online machine learning resources

distributed-ml-patterns

Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo

Language:PythonLicense:Apache-2.0Stargazers:346Issues:13Issues:5

fsdl-text-recognizer-2022-labs

Complete deep learning project developed in Full Stack Deep Learning, 2022 edition. Generated automatically from https://github.com/full-stack-deep-learning/fsdl-text-recognizer-2022

Language:Jupyter NotebookLicense:MITStargazers:280Issues:7Issues:1

recs-at-resonable-scale

Recommendations at "Reasonable Scale": joining dataOps with recSys through dbt, Merlin and Metaflow

Language:PythonLicense:MITStargazers:223Issues:9Issues:2

foundation-models-for-dbt-entity-matching

Playground for using large language models into the Modern Data Stack for entity matching

Language:PythonLicense:MITStargazers:104Issues:3Issues:0

dsbook

Code samples for the Effective Data Science Infrastructure book

Language:PythonLicense:Apache-2.0Stargazers:96Issues:6Issues:2

feast-workshop

A workshop with several modules to help learn Feast, an open-source feature store

Language:Jupyter NotebookStargazers:73Issues:10Issues:1

taxi-demo-rp-mz-rv-rd-st

🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations

Language:PythonLicense:MITStargazers:45Issues:4Issues:0

de4ml

Supporting materials/code examples for my course in data engineering for machine learning.

Language:PythonStargazers:38Issues:7Issues:0