Jianhong Wang (hsvgbkhgbv)

hsvgbkhgbv

Geek Repo

Company:University of Manchester

Location:Manchester

Home Page:hsvgbkhgbv.github.io

Twitter:@hsvgbkhgbv

Github PK Tool:Github PK Tool


Organizations
Future-Power-Networks

Jianhong Wang's repositories

SQDDPG

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

shapley-q-learning

This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.

Language:PythonStargazers:37Issues:2Issues:0

Snore-Sound-Classification-by-Deep-Learning

This is the implementation for the paper: A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification.

Language:PythonStargazers:18Issues:2Issues:0

PyPSA

PyPSA: Python for Power System Analysis

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

pypownet

A power network simulator with a Reinforcement Learning-focused usage.

Language:PythonLicense:LGPL-3.0Stargazers:1Issues:2Issues:0

coma

Convolutional Mesh Autoencoders for Generating 3D Faces

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

ConvLab

DSTC8 Track 1 Task 1 End-to-End Multi-Domain Dialog Challenge Result:

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

License:NOASSERTIONStargazers:0Issues:0Issues:0

hsvgbkhgbv.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

JaxMARL

Multi-Agent Reinforcement Learning with JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

lb-foraging

Level-Based Foraging (LBF): A multi-agent reinforcement learning environment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MADRaS

Multi-Agent DRiving Simulator

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
License:MITStargazers:0Issues:0Issues:0

multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

NeuralDialog-LaRL

PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU

License:Apache-2.0Stargazers:0Issues:0Issues:0

plato-research-dialogue-system

This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

poppy

:hibiscus: Population-Based Reinforcement Learning for Combinatorial Optimization

License:Apache-2.0Stargazers:0Issues:0Issues:0

PyBoy

Game Boy emulator written in Python

License:NOASSERTIONStargazers:0Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

random-network-distillation-pytorch

Random Network Distillation pytorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

safety-gym

Tools for accelerating safe exploration research.

License:MITStargazers:0Issues:0Issues:0

TextWorld

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

License:NOASSERTIONStargazers:0Issues:0Issues:0

use_vim_as_ide

use vim as IDE

License:CC0-1.0Stargazers:0Issues:0Issues:0

wqmix

Code for Weighted QMIX

Stargazers:0Issues:0Issues:0