baba888888's starred repositories

Language:PythonLicense:NOASSERTIONStargazers:140Issues:0Issues:0

Hierarchical-DQN

Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https://arxiv.org/pdf/1604.06057.pdf

Language:PythonStargazers:81Issues:0Issues:0

Actor-Sharer-Learner

Actor-Sharer-Learner training framework for off-policy DRL algorithms

Language:PythonLicense:MITStargazers:19Issues:0Issues:0

ros_motion_planning

Motion planning and Navigation of AGV/AMR:ROS planner plugin implementation of A*, JPS, D*, LPA*, D* Lite, Theta*, RRT, RRT*, RRT-Connect, Informed RRT*, ACO, PSO, Voronoi, PID, LQR, MPC, DWA, APF, Pure Pursuit etc.

Language:C++License:GPL-3.0Stargazers:2264Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3604Issues:0Issues:0

pomdp-py

A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/

Language:PythonLicense:MITStargazers:223Issues:0Issues:0

pymote2.0

Wireless sensor network simulation

Language:PythonLicense:NOASSERTIONStargazers:22Issues:0Issues:0

AirSim

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

Language:C++License:NOASSERTIONStargazers:16470Issues:0Issues:0
Stargazers:2Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0
Language:JavaStargazers:2Issues:0Issues:0

Charging-Sensors-Network-Optimization

A bi-level optimized charging algorithm for energy depletion avoidance in wireless rechargeable sensor networks

Language:PythonStargazers:4Issues:0Issues:0

wsn

A Wireless Sensor Network simulator in Python and C++ (via SWIG).

Language:PythonStargazers:74Issues:0Issues:0

DRL-and-graph-neural-network-for-routing-problems

This is the official code for the published paper 'Solve routing problems with a residual edge-graph attention neural network'

Language:PythonLicense:MITStargazers:192Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:10063Issues:0Issues:0

PPOxFamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Language:PythonLicense:Apache-2.0Stargazers:1974Issues:0Issues:0

CARSM

Code for Critic-ARSM policy gradient algorithm

Language:PythonStargazers:4Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:38Issues:0Issues:0

hybrid-action-RL

Hybrid action space reinforcement learning algorithms.

Language:PythonStargazers:12Issues:0Issues:0

jPPO-ConvNTM

[INFOCOM 2020] Energy-Efficient UAV Crowdsensing with Multiple Charging Stations by Deep Learning

Language:PythonStargazers:14Issues:0Issues:0

ObstacleAvoidanceForUAVs

Obstacle avoidance in UAVs with reinforcement learning (PPO)

Language:PythonStargazers:7Issues:0Issues:0

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language:PythonLicense:MITStargazers:1084Issues:0Issues:0

Dispatching-rules-for-FJSP

This is the official code for the baseline methods of the publised paper 'A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem'

Language:PythonLicense:MITStargazers:79Issues:0Issues:0

DRL-MTSP

Reinplemtation of paper "A reinforcement learning approach for optimizing multiple traveling salesman problems over graphs"

Language:PythonStargazers:53Issues:0Issues:0

obstacle-tower-agent

Reinforcement learning tackling challenges of third-person navigation in sparse 3D environment

Language:PythonStargazers:4Issues:0Issues:0

SAC

Multi-Discrete Soft Actor Critic implementation on Unity's procedurally generated Obstacle Tower Environment.

Language:PythonStargazers:8Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:65704Issues:0Issues:0
Language:RubyLicense:MITStargazers:1Issues:0Issues:0