Beast code in Giters

Xiaoyang Yu's starred repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT51977 435 130

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonNOASSERTION21997 637 262

FinRL

FinRL: Financial Reinforcement Learning. 🔥

Language:Jupyter NotebookMIT9444 199 707

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.08808 77 1005

ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Language:VueMIT5423 22 72

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptMIT5384 62 142

transformer-xl

Language:PythonApache-2.03577 83 133

visualboyadvance-m

The continuing development of the legendary VBA gameboy advance emulator.

Language:C++3272 109 847

torchscale

Foundation Architecture for (M)LLMs

Language:PythonMIT2975 46 76

Autoformer

About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008

Language:Jupyter NotebookMIT1830 15 202

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonMIT1208 7 89

CityFlow

A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

Language:C++Apache-2.0767 19 131

sumo-rl

Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.

Language:PythonMIT664 11 169

GITM

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory

583 22 13

HARL

Official implementation of HARL algorithms based on PyTorch.

Language:Python418 8 41

Crossformer

Official implementation of our ICLR 2023 paper "Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting"

Language:PythonApache-2.0404 3 24

Multi-Agent-Transformer

Language:Python305 9 37

smacv2

Language:PythonMIT184 5 30

RESCO

Reinforcement Learning Benchmarks for Traffic Signal Control (RESCO)

Language:Python107 5 22

Multi-Agent-Distributed-PPO-Traffc-light-control

multi agent RL for traffic light control in Sumo using distributed PPO

Language:PythonMIT87 4 5

CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Language:PythonApache-2.081 1 11

multi-agent-PPO-on-SMAC

Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.

Language:Python52 2 2

Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey

Transformer in RL for decision-making

50 10

unmas

the source code of UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios

Language:PythonApache-2.044 8 1

VDACs

Value-Decomposition Multi-Agent Actor-Critics

Language:PythonMIT39 1 5

pymarl_transformers

Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems (AAMAS 2023)

Language:Python31 2 1

ASN

Language:PythonApache-2.029 2 1

smac_exp

An open source benchmark for Multi Agent Reinforcement Learning

Language:Python29 20

A2PO-ICLR2023

Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

Language:PythonMIT25 1 2

FOP-DMAC-MACPF

Language:Python1003