Xiaoyang Yu (Lamperougeyxy)

Lamperougeyxy

Geek Repo

Company:Beijing Jiaotong University

Github PK Tool:Github PK Tool

Xiaoyang Yu's starred repositories

annotated_deep_learning_paper_implementations

๐Ÿง‘โ€๐Ÿซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐Ÿ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ŸŽฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐Ÿง 

Language:PythonLicense:MITStargazers:51977Issues:435Issues:130

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:21997Issues:637Issues:262

FinRL

FinRL: Financial Reinforcement Learning. ๐Ÿ”ฅ

Language:Jupyter NotebookLicense:MITStargazers:9444Issues:199Issues:707

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8808Issues:77Issues:1005

ccf-deadlines

โฐ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Language:VueLicense:MITStargazers:5423Issues:22Issues:72

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5384Issues:62Issues:142
Language:PythonLicense:Apache-2.0Stargazers:3577Issues:83Issues:133

visualboyadvance-m

The continuing development of the legendary VBA gameboy advance emulator.

torchscale

Foundation Architecture for (M)LLMs

Language:PythonLicense:MITStargazers:2975Issues:46Issues:76

Autoformer

About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008

Language:Jupyter NotebookLicense:MITStargazers:1830Issues:15Issues:202

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonLicense:MITStargazers:1208Issues:7Issues:89

CityFlow

A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

Language:C++License:Apache-2.0Stargazers:767Issues:19Issues:131

sumo-rl

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Language:PythonLicense:MITStargazers:664Issues:11Issues:169

GITM

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory

HARL

Official implementation of HARL algorithms based on PyTorch.

Crossformer

Official implementation of our ICLR 2023 paper "Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting"

Language:PythonLicense:Apache-2.0Stargazers:404Issues:3Issues:24
Language:PythonLicense:MITStargazers:184Issues:5Issues:30

RESCO

Reinforcement Learning Benchmarks for Traffic Signal Control (RESCO)

Multi-Agent-Distributed-PPO-Traffc-light-control

multi agent RL for traffic light control in Sumo using distributed PPO

Language:PythonLicense:MITStargazers:87Issues:4Issues:5

CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Language:PythonLicense:Apache-2.0Stargazers:81Issues:1Issues:11

multi-agent-PPO-on-SMAC

Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.

unmas

the source code of UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios

Language:PythonLicense:Apache-2.0Stargazers:44Issues:8Issues:1

VDACs

Value-Decomposition Multi-Agent Actor-Critics

Language:PythonLicense:MITStargazers:39Issues:1Issues:5

pymarl_transformers

Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems (AAMAS 2023)

Language:PythonLicense:Apache-2.0Stargazers:29Issues:2Issues:1

smac_exp

An open source benchmark for Multi Agent Reinforcement Learning

Language:PythonStargazers:29Issues:2Issues:0

A2PO-ICLR2023

Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

Language:PythonLicense:MITStargazers:25Issues:1Issues:2