Beast code in Giters

Xiaoyang Yu's starred repositories

SAAC-StarCraft-Adversary-Agent-Challenge

Language:Python900

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonMIT234100

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonMIT877700

transformer

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Language:Python54600

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT1366300

sentence-transformers

State-of-the-Art Text Embeddings

Language:PythonApache-2.01494000

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonApache-2.060400

epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Language:PythonApache-2.047500

pymarl-algorithm-extension-via-starcraft

Language:PythonApache-2.01200

mbpo_pytorch

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Language:Python15100

mve

MVE: model-based value estimation

Language:PythonApache-2.01000

NAF-tensorflow

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow

Language:PythonMIT19300

BIRD_code

Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".

Language:Python1400

Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Language:Jupyter NotebookApache-2.0108700

mpc.pytorch

A fast and differentiable model predictive control (MPC) solver for PyTorch.

Language:PythonMIT86700

do-mpc

Model predictive control python toolbox

Language:PythonLGPL-3.096000

pytorch-feudal-network

Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networks))

Language:Python1600

PILCO

Bayesian Reinforcement Learning in Tensorflow

Language:PythonMIT31300

Data-Efficient-Reinforcement-Learning-with-Probabilistic-Model-Predictive-Control

Unofficial Implementation of the paper "Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control", applied to gym environments

Language:PythonMIT12700

Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effectively discovers roles based on joint action space decomposition according to action effects, establishing a new state of the art on the StarCraft multi-agent benchmark.

Language:PythonApache-2.06700

Lamperougeyxy

Xiaoyang Yu's starred repositories

SAAC-StarCraft-Adversary-Agent-Challenge

decision-transformer

attention-is-all-you-need-pytorch

transformer

Swin-Transformer

sentence-transformers

pymarl2

epymarl

pymarl-algorithm-extension-via-starcraft

mbpo_pytorch

mve

NAF-tensorflow

nn_dynamics

BIRD_code

Popular-RL-Algorithms

mpc.pytorch

do-mpc

pytorch-feudal-network

PILCO

Data-Efficient-Reinforcement-Learning-with-Probabilistic-Model-Predictive-Control

RODE

gps

go-explore

examples

PyTorch-VAE

handful-of-trials

mbpo

pytorch-A3C

Evolutionary-Algorithm

transformers