Beast code in Giters

H.B. Jiang's repositories

robosumo-selfplay

Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.

Language:Python500

MACE

Code for "Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing" accepted by AAAI 2024.

Language:Python200

neurips2020-flatland-starter-kit

Forked from https://gitlab.aicrowd.com/flatland/neurips2020-flatland-starter-kit.git

Language:Jupyter NotebookMIT100

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT000

batch-ppo

Efficient Batched Reinforcement Learning in TensorFlow

Language:PythonApache-2.0000

CompilerProject-2020Spring

Course Project. PKU Compiler Design. Spring, 2020.

Language:C++MIT000

CS294_Fall-2017_HW

Assignments for CS294-112 Fall 2017

Language:Python000

CS294_Fall-2018_HW

Assignments for CS294-112 Fall 2018

Language:PythonMIT000

hbjiang.github.io

白嫖一下github的https🤣

Language:JavaScript000

infer-policy-feature

Language:Python000

lihang-code

《统计学习方法》的代码实现

Language:Jupyter Notebook000

meta-mapg-code

Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning"

Language:Python000

MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Language:JavaMIT000

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonNOASSERTION000

nd889

Udacity Artificial Intelligence Nanodegree

Language:Jupyter NotebookMIT000

openbilibili-go-common

哔哩哔哩 bilibili 网站后台工程源码

Language:Go000

pomegranate

Fast, flexible and easy to use probabilistic modelling in Python.

Language:Jupyter NotebookMIT000

robosumo

Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"

Language:Python000

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonMIT000

StarCraft

Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

000