wulihan20212021

wulihan20212021's starred repositories

ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Language:PythonGPL-3.01514400

StateAdvDRL

[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"

11000

rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

Language:C++Apache-2.064700

tarok

:spades: Slovenian Tarok card game environment for the OpenSpiel framework.

Language:C++MIT1000

Reinforcement_Learning_In_Two_Player_Simultaneous_Action_Games

Language:Jupyter Notebook300

Nash-DQN

Deep Reinforcement Learning for Nash Equilibria

Language:Jupyter Notebook3900

Quickest-Detection-FDI-Remote-Estimation

Code for our paper titled "Quickest detection of false data injection in remote state estimation" published at IEEE ISIT 2021.

Language:Jupyter Notebook600

Soft-Actor-Critic-Reinforcement-Learning-Mobile-Robot-Navigation

This example uses Soft Actor Critic(SAC) based reinforcement learning to develop the mobile robot navigation. For a brief summary of the SAC algorithm, see Soft Actor Critic(SAC) Agents. This example scenario trains a mobile robot to navigate from location A to location B to avoid obstacles given range sensor readings that detect obstacles in the map. The objective of the reinforcement learning algorithm is to learn what controls (linear and angular velocity) for navigation from an initial to goal position and during the travel also can avoid colliding into obstacles. This example uses an occupancy map of a known environment to generate range sensor readings, detect obstacles, and check collisions the robot may make. The range sensor readings are the observations for the SAC agent, and the linear and angular velocity controls are the action.

Language:MATLAB1100