Huili Chen (huilichen25)

huilichen25

Geek Repo

Github PK Tool:Github PK Tool

Huili Chen's starred repositories

rm-cooperative-marl

Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

apomdp

POMDP-based decision-making technique for Social Robots using ROS, Python and Julia

Language:JuliaLicense:GPL-3.0Stargazers:4Issues:0Issues:0

stochastic-reward-machines

Code for our AAAI-22 paper Reinforcement Learning with Stochastic Reward Machines

Language:C++Stargazers:3Issues:0Issues:0

tugger-routing

Case-Study on Reinforcement Learning for Intralogistics

Language:PythonStargazers:11Issues:0Issues:0

pyAFM

Additive factors model and additive factors model with slip implemented in python

Language:PythonLicense:MITStargazers:18Issues:0Issues:0

Reimplement_RewardMachine

just to reimplement a paper called " Using Reward Machines for High-Level Task Speciļ¬cation and Decomposition in Reinforcement Learning"

Language:PythonStargazers:2Issues:0Issues:0

gym-subgoal-automata

Environments from the papers "Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning" and "Induction and Exploitation of Subgoal Automata for Reinforcement Learning" using OpenAI Gym API.

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

deep-clustering

A tensorflow implementation for Deep clustering: Discriminative embeddings for segmentation and separation

Language:PythonStargazers:135Issues:0Issues:0

DeWave

Single-channel blind source separation

Language:PythonStargazers:48Issues:0Issues:0

deep-clustering

deep clustering method for single-channel speech separation

Language:PythonStargazers:109Issues:0Issues:0

contextualbandits

Python implementations of contextual bandits algorithms

Language:PythonLicense:BSD-2-ClauseStargazers:726Issues:0Issues:0