lzyyy58's starred repositories
imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Confidence-Aware-Imitation-Learning
Official implementation of the NeurIPS 2021 paper: S Zhang, Z Cao, D Sadigh, Y Sui: "Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality"
rlkit-relational
Codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"
Student-Information-Management-System
学生信息管理系统 JAVA Mysql 数据库课程设计 简单界面
Student-dormitory-management-system
学生宿舍管理系统(GUI):使用maven进行项目构建管理,使用javaFX和JFoenix设置界面,使用mysql数据库,业务流程使用mybatis加spring
StudentAchievementManagementSystem
Java+SQLServer学生成绩管理系统(代码+数据库)
Non-Local-NN-Pytorch
PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)
reinforcement_learning_phasic_policy_gradient
Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
RL-Projects-SK
Reinforcement Learning Projects
keras-self-attention
Attention mechanism for processing sequential data that considers the context for each timestamp.
tensor2robot
Distributed machine learning infrastructure for large-scale robotics research
Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..