hsvgbkhgbv

Jianhong Wang's repositories

SQDDPG

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

Language:Python106 4 10

shapley-q-learning

This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.

Language:Python37 20

Snore-Sound-Classification-by-Deep-Learning

This is the implementation for the paper: A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification.

Language:Python18 20

PyPSA

PyPSA: Python for Power System Analysis

Language:PythonMIT200

pypownet

A power network simulator with a Reinforcement Learning-focused usage.

Language:PythonLGPL-3.01 20

coma

Convolutional Mesh Autoencoders for Generating 3D Faces

NOASSERTION000

comix

Language:Python010

ConvLab

DSTC8 Track 1 Task 1 End-to-End Multi-Domain Dialog Challenge Result:

Language:PythonMIT020

dcg

Apache-2.0000

gym

A toolkit for developing and comparing reinforcement learning algorithms.

NOASSERTION000

hsvgbkhgbv.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

JaxMARL

Multi-Agent Reinforcement Learning with JAX

Language:PythonApache-2.0000

larl_trial

Language:Python000

lb-foraging

Level-Based Foraging (LBF): A multi-agent reinforcement learning environment

Language:PythonMIT000

MADRaS

Multi-Agent DRiving Simulator

Language:PythonAGPL-3.0010

master_thesis

000

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Language:PythonMIT020

multiagent-particle-envs

MIT000

multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Language:PythonMIT020

NeuralDialog-LaRL

PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU

Apache-2.0000