Mikhail Grankin's repositories
fast_tabnet
TabNet for fastai
deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
entropix
Entropy Based Sampling and Parallel CoT Decoding
FoodSeg103-Benchmark-v1
MM'21 Main-Track paper
google-research
Google Research
mgpt
Multilingual Generative Pretrained Model
minGPT-quantize
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
pytorch-vq-vae
PyTorch implementation of VQ-VAE by Aäron van den Oord et al.
RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
tab-transformer-pytorch
Implementation of TabTransformer, attention network for tabular data, in Pytorch
vector-quantize-pytorch
Vector Quantization, in Pytorch
yarn
YaRN: Efficient Context Window Extension of Large Language Models