Xiaoyang Yu's starred repositories
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
sentence-transformers
State-of-the-Art Text Embeddings
mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
NAF-tensorflow
"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
mpc.pytorch
A fast and differentiable model predictive control (MPC) solver for PyTorch.
pytorch-feudal-network
Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networks))
Data-Efficient-Reinforcement-Learning-with-Probabilistic-Model-Predictive-Control
Unofficial Implementation of the paper "Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control", applied to gym environments
RODE
Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effectively discovers roles based on joint action space decomposition according to action effects, establishing a new state of the art on the StarCraft multi-agent benchmark.
go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
Evolutionary-Algorithm
Evolutionary Algorithm using Python, 莫烦Python 中文AI教学
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.