CDM1619

CDM1619

Geek Repo

Location:China

Github PK Tool:Github PK Tool

CDM1619's repositories

alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

AlphaHydrogen

AlphaHydrogen is an open source OpenAI Gym environment that simulates the energy system of a residential community with distributed renewable power supply, fuel-cell vehicles, hydrogen stations, and power grid.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

CloseAirCombat

An environment based on JSBSIM aimed at one-to-one close air combat.

Language:PythonStargazers:0Issues:0Issues:0

Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:0Issues:0Issues:0

gym-marl-reconnaissance

Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Malib

A parallel framework for population-based multi-agent reinforcement learning.

License:MITStargazers:0Issues:0Issues:0

MAPDN

This repository is for an open-source environment for multi-agent active voltage control on power distribution networks (MAPDN).

License:MITStargazers:0Issues:0Issues:0

NAC

NeurIPS-2021: Neural Auto-Curricula in Two-Player Zero-Sum Games.

Stargazers:0Issues:0Issues:0

nash-dqn

Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games. Zihan Ding, Dijia Su, Qinghua Liu, Chi Jin

Stargazers:0Issues:0Issues:0

NXDO

Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games

License:MITStargazers:0Issues:0Issues:0

Online-dt

Online Decision Transformer

License:NOASSERTIONStargazers:0Issues:0Issues:0

Open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

License:Apache-2.0Stargazers:0Issues:0Issues:0

PGSIM

PGSIM Simulator

Stargazers:0Issues:0Issues:0

Pipeline-PSRO

Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

License:MITStargazers:0Issues:0Issues:0

poker-bot

An OpenAI gym environment and RL agent for Texas hold 'em Poker

Stargazers:0Issues:0Issues:0

PowerGridworld

PowerGridworld provides users with a lightweight, modular, and customizable framework for creating power-systems-focused, multi-agent Gym environments that readily integrate with existing training frameworks for reinforcement learning (RL). https://arxiv.org/abs/2111.05969

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

PSRO_BD_RD

Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

SciencePlots

Matplotlib styles for scientific plotting

License:MITStargazers:0Issues:0Issues:0

Stratego_Env

Multi-Agent RL Environment for the Stratego Board Game (and variants)

License:MITStargazers:0Issues:0Issues:0