RuanJingqing's starred repositories

NLPer-Interview

该仓库主要记录 NLP 算法工程师相关的面试题

diffuser

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Language:PythonLicense:MITStargazers:810Issues:12Issues:61

pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

Language:PythonLicense:MITStargazers:795Issues:9Issues:37

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:599Issues:16Issues:40

pytorch-DRL

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Language:PythonLicense:MITStargazers:524Issues:12Issues:7

trajectory-transformer

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Language:PythonLicense:MITStargazers:451Issues:6Issues:20

MARL-code-pytorch

Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.

Language:PythonLicense:MITStargazers:411Issues:2Issues:23

ACER

Actor-critic with experience replay

Language:PythonLicense:MITStargazers:251Issues:13Issues:13

DRL

Repository for codes of 'Deep Reinforcement Learning'

Language:PythonStargazers:214Issues:10Issues:0
Language:PythonLicense:MITStargazers:193Issues:5Issues:32

CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Language:PythonLicense:Apache-2.0Stargazers:82Issues:1Issues:11

mPLUG

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

REFIL

Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021

Language:PythonLicense:MITStargazers:61Issues:2Issues:3

DualGATs

Code for ACL2023 paper 《DualGATs: Dual Graph Attention Networks for Emotion Recognition in Conversations》

ToM2C

The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .

Language:PythonStargazers:42Issues:3Issues:0

TSAM

The code for COLING2022 paper: 《TSAM: A Two-Stream Attention Model for Causal Emotion Entailment》

Language:PythonStargazers:32Issues:1Issues:0

action-hypergraph-networks

(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices

Language:PythonLicense:MITStargazers:21Issues:1Issues:1
Language:PythonStargazers:10Issues:0Issues:0

Qatten_Multiagent_RL

Implement of Qatten on SMAC (updating)

Language:PythonStargazers:6Issues:1Issues:0

contextual-policy-reuse-deep-rl

Framework for Contextually Transferring Knowledge from Multiple Source Policies in Deep Reinforcement Learning

License:MITStargazers:3Issues:2Issues:0

FISS

[CVPR2023] Federated Incremental Semantic Segmentation

Language:PythonStargazers:2Issues:0Issues:0

controlvideo

Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

WeTS

A benchmark for the task of translation suggestion

Language:MaskLicense:UnlicenseStargazers:1Issues:0Issues:0