vin136 / MLOPS

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MLOPS

project-ideas

Distribution-Shift

Uncertanity Estimation

Resources:

FullStack DL

Made With ML

Chip-huyen's book

ML-SYSTEM DESIGN Go through the slides and read the attached research papers.

ML-INTERVIEW'S BOOK

Some links from this reddit-thread

Interview-prep

DataScience

Writing Good Research Code

GRC

reprdl

Project

A Recommender system using the ideas from - NCF

Tools

  1. Hydra: Here's a good tutorial
  2. Pytorch-lightning(For a well structured code-base)
  3. Wand(for hyperparmeter tuning and experment tracking)
  4. Githooks,testing,styling.(Good practices)

Extra: Not really necessary for every research project.

Data-Versioning: DVC

Containerizaiton: Docker

Summary

For mac M1 users to install sklearn.

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

Now change the path, use your user name

echo 'eval $(/opt/homebrew/bin/brew shellenv)' >> /Users/"YOUR USER NAME"/.zprofile
eval $(/opt/homebrew/bin/brew shellenv)
brew install openblas
export OPENBLAS=$(/opt/homebrew/bin/brew --prefix openblas)
export CFLAGS="-falign-functions=8 ${CFLAGS}"
# ^ no need to add to .zshrc, just doing this once.
pip install scikit-learn # ==0.24.1 if you want

Flags

Multirun flag

eg:python main.py lr=1e-3,1e-2 wd=1e-4,1e-2 -m

Print config

eg:python main.py --cfg job

Modify

eg:python train.py data.batch_size=4

Create a flag and set it.

eg:python train.py +trainer.fast_dev_run=True

Remove a flag

eg:python train.py ~trainer.gpus

  1. Pytorch-lightning+Hydra for configuration.

About


Languages

Language:Jupyter Notebook 82.3%Language:Python 17.7%