AndyYue1893

AndyYue1893's starred repositories

simple-pid

A simple and easy to use PID controller in Python

Language:PythonMIT72300

gym-jsbsim

A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model

Language:PythonMIT15900

gym-jsbsim-f16

A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model

MIT300

CloseAirCombat

An environment based on JSBSIM aimed at one-to-one close air combat.

Language:PythonGPL-3.022400

CloseAirCombat_baseline

An environment based on JSBSIM aimed at one-to-one close air combat.

Language:Python800

jsbsim

An open source flight dynamics & control software library

Language:C++LGPL-2.1126000

MARLlib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Language:PythonMIT81400

MARLlib

This code base enables multi-agent RL in the RLlib

400

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT828800

llama2.c

Inference Llama 2 in one file of pure C

Language:CMIT1667800

gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

Language:PythonMIT113200

flightmare

An Open Flexible Quadrotor Simulator

Language:C++NOASSERTION92900

COVID-19-SEIR-LSTM

本项目实现2019新型冠状病毒肺炎预测，分别采用经典传染病动力学模型SEIR和LSTM神经网络实现，通过控制模型参数来改变干预程度，体现防控的意义。

Language:Python10300

LLM-Optimizers-Papers

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.

18800

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonApache-2.0121600

TIT_open_source

The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"

Language:Python5000

demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention Multi-Agent DDPG) and NCC-MARL (Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning).

Language:Python3300

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonMIT857800

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonMIT2329000

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonApache-2.03683900

ChatGLM-Tuning

基于ChatGLM-6B + LoRA的Fintune方案

Language:PythonMIT370000

LLM-with-RL-papers

A collection of LLM with RL papers

20200

PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Language:PythonMIT176400

omnisafe

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonApache-2.087200

safety-gymnasium

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Language:PythonApache-2.034000

HEBO

Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab

Language:Jupyter Notebook303200

Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:Python3768000

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT351300

awesome-rl

Reinforcement learning resources curated

869900

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonMIT1329700