Mikhail Grankin's repositories

Language:PythonLicense:Apache-2.0Stargazers:775Issues:31Issues:44

over9000

Over9000 optimizer

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:425Issues:20Issues:19

fast_tabnet

TabNet for fastai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:123Issues:7Issues:10

minGPT

minGPT in JAX

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46Issues:2Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10Issues:3Issues:0

ru-gpts

Russian GPT2 models.

Language:PythonLicense:Apache-2.0Stargazers:5Issues:3Issues:0

gpt-c

Train GPT model helped by CLIP

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:3Issues:0
Language:PythonStargazers:2Issues:3Issues:0

CLIP_JAX

Contrastive Language-Image Pretraining

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dSRVAE

Unsupervised Real Image Super-Resolution via Variational AutoEncoder

Language:PythonStargazers:0Issues:2Issues:0

entropix

Entropy Based Sampling and Parallel CoT Decoding

License:Apache-2.0Stargazers:0Issues:0Issues:0

FoodSeg103-Benchmark-v1

MM'21 Main-Track paper

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mgpt

Multilingual Generative Pretrained Model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

minGPT-quantize

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:TypeScriptStargazers:0Issues:1Issues:0

pytorch-vq-vae

PyTorch implementation of VQ-VAE by Aäron van den Oord et al.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

RETRO-pytorch

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

stihbot

telegram bot for Russian gpt models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

tab-transformer-pytorch

Implementation of TabTransformer, attention network for tabular data, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

vector-quantize-pytorch

Vector Quantization, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0