Xiaoyang Yu (Lamperougeyxy)

Lamperougeyxy

Geek Repo

Company:Beijing Jiaotong University

Github PK Tool:Github PK Tool

Xiaoyang Yu's starred repositories

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonLicense:MITStargazers:2341Issues:0Issues:0

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8777Issues:0Issues:0

transformer

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Language:PythonStargazers:546Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13663Issues:0Issues:0

sentence-transformers

State-of-the-Art Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:14940Issues:0Issues:0

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:604Issues:0Issues:0

epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Language:PythonLicense:Apache-2.0Stargazers:475Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0

mbpo_pytorch

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Language:PythonStargazers:151Issues:0Issues:0

mve

MVE: model-based value estimation

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

NAF-tensorflow

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow

Language:PythonLicense:MITStargazers:193Issues:0Issues:0
Language:Jupyter NotebookStargazers:332Issues:0Issues:0

BIRD_code

Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".

Language:PythonStargazers:14Issues:0Issues:0

Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1087Issues:0Issues:0

mpc.pytorch

A fast and differentiable model predictive control (MPC) solver for PyTorch.

Language:PythonLicense:MITStargazers:867Issues:0Issues:0

do-mpc

Model predictive control python toolbox

Language:PythonLicense:LGPL-3.0Stargazers:960Issues:0Issues:0

pytorch-feudal-network

Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networks))

Language:PythonStargazers:16Issues:0Issues:0

PILCO

Bayesian Reinforcement Learning in Tensorflow

Language:PythonLicense:MITStargazers:313Issues:0Issues:0

Data-Efficient-Reinforcement-Learning-with-Probabilistic-Model-Predictive-Control

Unofficial Implementation of the paper "Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control", applied to gym environments

Language:PythonLicense:MITStargazers:127Issues:0Issues:0

RODE

Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effectively discovers roles based on joint action space decomposition according to action effects, establishing a new state of the art on the StarCraft multi-agent benchmark.

Language:PythonLicense:Apache-2.0Stargazers:67Issues:0Issues:0

gps

Guided Policy Search

Language:PythonLicense:NOASSERTIONStargazers:597Issues:0Issues:0

go-explore

Code for Go-Explore: a New Approach for Hard-Exploration Problems

Language:PythonLicense:NOASSERTIONStargazers:556Issues:0Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:22268Issues:0Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:6503Issues:0Issues:0

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:PythonLicense:MITStargazers:424Issues:0Issues:0

mbpo

Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

Language:PythonLicense:MITStargazers:476Issues:0Issues:0

pytorch-A3C

Simple A3C implementation with pytorch + multiprocessing

Language:PythonLicense:MITStargazers:612Issues:0Issues:0

Evolutionary-Algorithm

Evolutionary Algorithm using Python, 莫烦Python 中文AI教学

Language:PythonLicense:MITStargazers:1192Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:132696Issues:0Issues:0