Beast code in Giters

CDM1619's repositories

alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Language:Jupyter NotebookMIT000

AlphaHydrogen is an open source OpenAI Gym environment that simulates the energy system of a residential community with distributed renewable power supply, fuel-cell vehicles, hydrogen stations, and power grid.

Language:Jupyter NotebookMIT000

CloseAirCombat

An environment based on JSBSIM aimed at one-to-one close air combat.

Language:Python000

Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

Apache-2.0000

DIPO

Language:PythonMIT000

diverse_psro

Language:Python000

DMPO-vanilla

Language:Python000

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.0000

gym-marl-reconnaissance

Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams

Language:PythonMIT000

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Apache-2.0000

Machine-Learning

000

Malib

A parallel framework for population-based multi-agent reinforcement learning.

MIT000

MAPDN

This repository is for an open-source environment for multi-agent active voltage control on power distribution networks (MAPDN).

MIT000

NAC

NeurIPS-2021: Neural Auto-Curricula in Two-Player Zero-Sum Games.

000

nash-dqn

Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games. Zihan Ding, Dijia Su, Qinghua Liu, Chi Jin

000

NXDO

Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games

MIT000

Online-dt

Online Decision Transformer

NOASSERTION000

Open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Apache-2.0000

PGSIM

PGSIM Simulator

000

Pipeline-PSRO

Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

MIT000

poker-bot

An OpenAI gym environment and RL agent for Texas hold 'em Poker

000

PowerGridworld

PowerGridworld provides users with a lightweight, modular, and customizable framework for creating power-systems-focused, multi-agent Gym environments that readily integrate with existing training frameworks for reinforcement learning (RL). https://arxiv.org/abs/2111.05969

BSD-3-Clause000

CDM1619

CDM1619's repositories

alpha-zero-general

AlphaHydrogen

CloseAirCombat

Diff4RLSurvey

DIPO

diverse_psro

DMPO-vanilla

generative_agents

gym-marl-reconnaissance

lmdeploy

Machine-Learning

Malib

MAPDN

NAC

nash-dqn

NXDO

Online-dt

Open_spiel

PGSIM

Pipeline-PSRO

poker-bot

PowerGridworld

PSRO_BD_RD

RL-MPC

SciencePlots

Stratego_Env