Beast code in Giters

RuanJingqing's repositories

GCS_aamas337

The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》

Language:Python39 2 3

CARE-SMAC-MA_SAC

Multi-task Multi-agent Soft Actor Critic for SMAC

Language:Python12 10

EFA-DWM

Language:Python5 1 1

Papers-of-MARL

3 10

Conventions-ModularPolicy

PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021

Language:Python200

Deep-Reinforcement-Learning-Algorithms

This is a reconstruction of previous repository(rl-algorithms).

Language:Python100

Papers-of-Offline-RL

Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)

100

AI-Paper-Collector

Fully-automated scripts for collecting AI-related papers

Language:Python000

Algorithm_Interview_Notes-Chinese-backups

Language:Python000

attention-learn-to-route

Attention based model for learning to solve different routing problems

Language:Jupyter NotebookMIT000

Ball-Run

Language:Python010

BladeDancer957

000

Bullet-Safety-Gym

An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.

Language:PythonMIT000

CORRO

CORRO code

Language:Python000

DGN

DGN Code

Language:Python000

DuaLight

Language:Python000

Flowcomm

Language:Python000

football

Check out the new game server:

Language:PythonApache-2.0000

GCS

The implementation of GCS

Language:Python000

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonGPL-3.0000

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonMIT000

multi-agent-PPO-on-SMAC

Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.

Language:Python000

NLPer-Interview

该仓库主要记录 NLP 算法工程师相关的面试题

000

on-policy

Language:PythonMIT000

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++Apache-2.0000

Paper_Writing_Tips

000

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

MIT000

SEQ-SCD

Language:Python000

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookMIT000

WeTS

A benchmark for the task of translation suggestion

Language:MaskUnlicense000