wulihan20212021

wulihan20212021

Geek Repo

Github PK Tool:Github PK Tool

wulihan20212021's starred repositories

ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Language:PythonLicense:GPL-3.0Stargazers:15144Issues:0Issues:0

StateAdvDRL

[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"

Stargazers:110Issues:0Issues:0

rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

Language:C++License:Apache-2.0Stargazers:647Issues:0Issues:0

tarok

:spades: Slovenian Tarok card game environment for the OpenSpiel framework.

Language:C++License:MITStargazers:10Issues:0Issues:0

Nash-DQN

Deep Reinforcement Learning for Nash Equilibria

Language:Jupyter NotebookStargazers:39Issues:0Issues:0

Quickest-Detection-FDI-Remote-Estimation

Code for our paper titled "Quickest detection of false data injection in remote state estimation" published at IEEE ISIT 2021.

Language:Jupyter NotebookStargazers:6Issues:0Issues:0

Soft-Actor-Critic-Reinforcement-Learning-Mobile-Robot-Navigation

This example uses Soft Actor Critic(SAC) based reinforcement learning to develop the mobile robot navigation. For a brief summary of the SAC algorithm, see Soft Actor Critic(SAC) Agents. This example scenario trains a mobile robot to navigate from location A to location B to avoid obstacles given range sensor readings that detect obstacles in the map. The objective of the reinforcement learning algorithm is to learn what controls (linear and angular velocity) for navigation from an initial to goal position and during the travel also can avoid colliding into obstacles. This example uses an occupancy map of a known environment to generate range sensor readings, detect obstacles, and check collisions the robot may make. The range sensor readings are the observations for the SAC agent, and the linear and angular velocity controls are the action.

Language:MATLABStargazers:11Issues:0Issues:0

Reinforcement_learning-PID-auto-tuning

Auto tuning of PID parameters of a quad-rotor using Q-learning

Language:MATLABStargazers:18Issues:0Issues:0

rl-agent-based-traffic-control

Develop agent-based traffic management system by model-free reinforcement learning

Language:MATLABLicense:NOASSERTIONStargazers:44Issues:0Issues:0

swarm_evolve

Model-Based Stochastic Search for Large Scale Optimization of Multi-Agent UAV Swarms

Language:C++License:MITStargazers:15Issues:0Issues:0
Language:MATLABStargazers:7Issues:0Issues:0

Adv-MARL

Adversarial attacks in consensus-based multi-agent reinforcement learning

Language:PythonStargazers:16Issues:0Issues:0

NetworkSecRLwithPareto

Network Security Attack and Defence Strategy selection using Reinforcement Learning and Pareto efficiency

Language:MatlabStargazers:5Issues:0Issues:0

SDN_DDoS_Simulation

An attempt to detect and prevent DDoS attacks using reinforcement learning. The simulation was done using Mininet.

Language:PythonLicense:MITStargazers:105Issues:0Issues:0
Language:PythonStargazers:12Issues:0Issues:0
Language:MATLABStargazers:8Issues:0Issues:0

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

Stargazers:3967Issues:0Issues:0