H.B. Jiang (SigmaBM)

SigmaBM

Geek Repo

Company:Peking University

Location:China

Github PK Tool:Github PK Tool

H.B. Jiang's repositories

robosumo-selfplay

Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.

Language:PythonStargazers:5Issues:0Issues:0

MACE

Code for "Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing" accepted by AAAI 2024.

Language:PythonStargazers:2Issues:0Issues:0

neurips2020-flatland-starter-kit

Forked from https://gitlab.aicrowd.com/flatland/neurips2020-flatland-starter-kit.git

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

batch-ppo

Efficient Batched Reinforcement Learning in TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CompilerProject-2020Spring

Course Project. PKU Compiler Design. Spring, 2020.

Language:C++License:MITStargazers:0Issues:0Issues:0

CS294_Fall-2017_HW

Assignments for CS294-112 Fall 2017

Language:PythonStargazers:0Issues:0Issues:0

CS294_Fall-2018_HW

Assignments for CS294-112 Fall 2018

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hbjiang.github.io

白嫖一下github的https🤣

Language:JavaScriptStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

lihang-code

《统计学习方法》的代码实现

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

meta-mapg-code

Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning"

Language:PythonStargazers:0Issues:0Issues:0

MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Language:JavaLicense:MITStargazers:0Issues:0Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

nd889

Udacity Artificial Intelligence Nanodegree

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

openbilibili-go-common

哔哩哔哩 bilibili 网站后台工程 源码

Language:GoStargazers:0Issues:0Issues:0

pomegranate

Fast, flexible and easy to use probabilistic modelling in Python.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

robosumo

Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"

Language:PythonStargazers:0Issues:0Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

StarCraft

Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Stargazers:0Issues:0Issues:0