zhuanghb3

zhuanghb3

Geek Repo

Github PK Tool:Github PK Tool

zhuanghb3's starred repositories

LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

TensorFlow-Examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:43374Issues:2047Issues:233

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:34559Issues:1058Issues:1825

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonLicense:MITStargazers:13505Issues:555Issues:99

tutorials

PyTorch tutorials.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:8128Issues:180Issues:795

Data-Science-Notes

数据科学的笔记以及资料搜集

Language:Jupyter NotebookStargazers:8116Issues:237Issues:15

Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Language:PythonLicense:Apache-2.0Stargazers:5293Issues:37Issues:580

algorithms

Algorithms & Data structures in C++.

Language:C++License:MITStargazers:5242Issues:371Issues:25

HighwayEnv

A minimalist environment for decision-making in autonomous driving

Language:PythonLicense:MITStargazers:2580Issues:29Issues:462

awesome-reinforcement-learning-zh

中文整理的强化学习资料(Reinforcement Learning)

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

SMARTS

Scalable Multi-Agent RL Training School for Autonomous Driving

Language:PythonLicense:MITStargazers:931Issues:13Issues:1008

Data-Structure-And-Algorithm

Data Structure And Algorithm(常用数据结构与算法C/C++实现)

learn-cpp

Codecademy | Learn C++

pytorch-DRL

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Language:PythonLicense:MITStargazers:524Issues:12Issues:7

Modeling-and-Simulation-of-MATLAB-Simulink-Communication-System

详解MATLAB Simulink通信系统建模与仿真 刘学勇编著 源码

MARL_CAVs

MARL for Autonomous Vehicles

reinforcement_learning_financial_trading

MATLAB example on how to use Reinforcement Learning for developing a financial trading model

Language:MATLABLicense:NOASSERTIONStargazers:151Issues:17Issues:0

Reinforcement-learning-Algorithms-and-Dynamic-Programming

Reinforcement learning Algorithms such as SARSA, Q learning, Actor-Critic Policy Gradient and Value Function Approximation were applied to stabilize an inverted pendulum system and achieve optimal control. So essentially, the concept of Reinforcement Learning Controllers has been established. The Reinforcement Learning Controllers have been compared on the basis of performance and efficiency and they are separately compared with the classical Linear Quadratic Regulator Controller. Each of the RL controller have been integrated with a Swing up controller. A virtual switch toggles between the Swing up controller and the RL controller automatically, based on the value of the angular deviation theta with respect to the vertical plane. My research paper and my undergraduate thesis have been uploaded for reference. All the codes have also been uploaded.

Language:MATLABStargazers:100Issues:7Issues:0

reinforcement-learning

Implementation of Single-Agent and Multi-Agent Reinforcement Learning Algorithms. MATLAB.

Language:MatlabStargazers:57Issues:1Issues:0

Reinforcement-Learning-An-introduction

solutions to the examples and exercises

Language:MatlabStargazers:42Issues:4Issues:0

Multi_Agent_Soft_Actor_Critic

A Pytorch Implementation of Multi Agent Soft Actor Critic

Language:Jupyter NotebookStargazers:34Issues:2Issues:1

multiagent-sac

Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.

Language:PythonLicense:MITStargazers:31Issues:1Issues:1

Paper-List-of-MARL

A new paper list for multi-agent reinforcement learning (actively maintained)

DRL-baseline

Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C

Language:PythonStargazers:18Issues:1Issues:0

MultiAgentLearning

多智能体学习库

Robot-Cluster-Control

Cluster robot with Matlab

Language:MATLABStargazers:7Issues:1Issues:0

Advanced-Soft-Actor-Critic

Soft Actor-Critic with advanced features

Language:PythonStargazers:2Issues:1Issues:0

RVSS2019-WS

Repo for the workshop part of the Australian Centre for Robotic Vision Summer School RVSS2019

Language:PythonStargazers:1Issues:2Issues:0