Sainbayar Sukhbaatar's starred repositories
pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
deep-photo-styletransfer
Code and data for paper "Deep Photo Style Transfer": https://arxiv.org/abs/1703.07511
deep-learning-models
Keras code and weights files for popular deep learning models.
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
debugger.lua
A dependency free, embeddable debugger for Lua in a single file (.lua or .h)
modular_rl
Implementation of TRPO and related algorithms
adaptive-span
Transformer training code for sequential tasks
learning-to-communicate
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
unlikelihood_training
Neural Text Generation with Unlikelihood Training
transformer-sequential
Trains Transformer model variants. Data isn't shuffled between batches.
gym-starcraft
StarCraft: BroodWars OpenAI Gym environment
Multiple-smi
Python bindings for pyNVML and psutil library over network