AndyYue1893's starred repositories

simple-pid

A simple and easy to use PID controller in Python

Language:PythonLicense:MITStargazers:723Issues:0Issues:0

gym-jsbsim

A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model

Language:PythonLicense:MITStargazers:159Issues:0Issues:0

gym-jsbsim-f16

A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model

License:MITStargazers:3Issues:0Issues:0

CloseAirCombat

An environment based on JSBSIM aimed at one-to-one close air combat.

Language:PythonLicense:GPL-3.0Stargazers:224Issues:0Issues:0

CloseAirCombat_baseline

An environment based on JSBSIM aimed at one-to-one close air combat.

Language:PythonStargazers:8Issues:0Issues:0

jsbsim

An open source flight dynamics & control software library

Language:C++License:LGPL-2.1Stargazers:1260Issues:0Issues:0

MARLlib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Language:PythonLicense:MITStargazers:814Issues:0Issues:0

MARLlib

This code base enables multi-agent RL in the RLlib

Stargazers:4Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8288Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

Language:CLicense:MITStargazers:16678Issues:0Issues:0

gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

Language:PythonLicense:MITStargazers:1132Issues:0Issues:0

flightmare

An Open Flexible Quadrotor Simulator

Language:C++License:NOASSERTIONStargazers:929Issues:0Issues:0

COVID-19-SEIR-LSTM

本项目实现2019新型冠状病毒肺炎预测,分别采用经典传染病动力学模型SEIR和LSTM神经网络实现,通过控制模型参数来改变干预程度,体现防控的意义。

Language:PythonStargazers:103Issues:0Issues:0

LLM-Optimizers-Papers

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.

Stargazers:188Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1216Issues:0Issues:0

TIT_open_source

The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"

Language:PythonStargazers:50Issues:0Issues:0

marl_demo

demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention Multi-Agent DDPG) and NCC-MARL (Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning).

Language:PythonStargazers:33Issues:0Issues:0

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8578Issues:0Issues:0

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23290Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36839Issues:0Issues:0

ChatGLM-Tuning

基于ChatGLM-6B + LoRA的Fintune方案

Language:PythonLicense:MITStargazers:3700Issues:0Issues:0

LLM-with-RL-papers

A collection of LLM with RL papers

Stargazers:202Issues:0Issues:0

PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Language:PythonLicense:MITStargazers:1764Issues:0Issues:0

omnisafe

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonLicense:Apache-2.0Stargazers:872Issues:0Issues:0

safety-gymnasium

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Language:PythonLicense:Apache-2.0Stargazers:340Issues:0Issues:0

HEBO

Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab

Language:Jupyter NotebookStargazers:3032Issues:0Issues:0

Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:PythonStargazers:37680Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3513Issues:0Issues:0

awesome-rl

Reinforcement learning resources curated

Stargazers:8699Issues:0Issues:0

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonLicense:MITStargazers:13297Issues:0Issues:0