RuanJingqing's repositories

GCS_aamas337

The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》

CARE-SMAC-MA_SAC

Multi-task Multi-agent Soft Actor Critic for SMAC

Language:PythonStargazers:12Issues:1Issues:0

Conventions-ModularPolicy

PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021

Language:PythonStargazers:2Issues:0Issues:0

Papers-of-Offline-RL

Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)

Stargazers:1Issues:0Issues:0

AI-Paper-Collector

Fully-automated scripts for collecting AI-related papers

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

attention-learn-to-route

Attention based model for learning to solve different routing problems

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

Bullet-Safety-Gym

An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CORRO

CORRO code

Language:PythonStargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms

This is a reconstruction of previous repository(rl-algorithms).

Language:PythonStargazers:0Issues:0Issues:0

DGN

DGN Code

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

football

Check out the new game server:

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GCS

The implementation of GCS

Stargazers:0Issues:0Issues:0

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

multi-agent-PPO-on-SMAC

Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.

Language:PythonStargazers:0Issues:0Issues:0

NLPer-Interview

该仓库主要记录 NLP 算法工程师相关的面试题

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

WeTS

A benchmark for the task of translation suggestion

Language:MaskLicense:UnlicenseStargazers:0Issues:0Issues:0