baba888888's starred repositories
Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https://arxiv.org/pdf/1604.06057.pdf
Actor-Sharer-Learner
Actor-Sharer-Learner training framework for off-policy DRL algorithms
ros_motion_planning
Motion planning and Navigation of AGV/AMR:ROS planner plugin implementation of A*, JPS, D*, LPA*, D* Lite, Theta*, RRT, RRT*, RRT-Connect, Informed RRT*, ACO, PSO, Voronoi, PID, LQR, MPC, DWA, APF, Pure Pursuit etc.
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Charging-Sensors-Network-Optimization
A bi-level optimized charging algorithm for energy depletion avoidance in wireless rechargeable sensor networks
DRL-and-graph-neural-network-for-routing-problems
This is the official code for the published paper 'Solve routing problems with a residual edge-graph attention neural network'
PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
hybrid-action-RL
Hybrid action space reinforcement learning algorithms.
jPPO-ConvNTM
[INFOCOM 2020] Energy-Efficient UAV Crowdsensing with Multiple Charging Stations by Deep Learning
ObstacleAvoidanceForUAVs
Obstacle avoidance in UAVs with reinforcement learning (PPO)
DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Dispatching-rules-for-FJSP
This is the official code for the baseline methods of the publised paper 'A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem'
obstacle-tower-agent
Reinforcement learning tackling challenges of third-person navigation in sparse 3D environment
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。