Beast code in Giters

lzyyy58's starred repositories

imitation_learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Language:Python13100

DDPGfD

DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.

Language:Python2800

robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Language:PythonMIT52000

Rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

Language:PythonApache-2.040600

bdq_sb

Fork of stable baselines, provides implementation of BDQ algorithm https://arxiv.org/abs/1711.08946

Language:PythonMIT800

Confidence-Aware-Imitation-Learning

Official implementation of the NeurIPS 2021 paper: S Zhang, Z Cao, D Sadigh, Y Sui: "Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality"

Language:PythonMIT700

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonNOASSERTION1798300

RHER

The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Tasks with Sparse Rewards”

Language:PythonMIT13400

rlkit-relational

Codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"

Language:PythonMIT9700

Student-Information-Management-System

学生信息管理系统 JAVA Mysql 数据库课程设计简单界面

Language:JavaEPL-2.013400

Student-dormitory-management-system

学生宿舍管理系统（GUI）：使用maven进行项目构建管理，使用javaFX和JFoenix设置界面，使用mysql数据库，业务流程使用mybatis加spring

Language:Java3600

StudentAchievementManagementSystem

Java+SQLServer学生成绩管理系统（代码+数据库）

Language:Java17200

Non-Local-NN-Pytorch

PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)

Language:PythonMIT23800

interact

Implementations of deep reinforcement learning algorithms.

Language:PythonMIT300

reinforcement_learning_phasic_policy_gradient

Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow

Language:PythonGPL-3.01900

alf

Agent Learning Framework https://alf.readthedocs.io

Language:PythonApache-2.029200

TrulyPPO

Language:Python2800

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language:PythonMIT97100

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION495200

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

Language:PythonMIT24400

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonMIT410000

DRLib

DRLib：a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.

Language:PythonMIT50000

RL-Projects-SK

Reinforcement Learning Projects

Language:Jupyter NotebookGPL-3.01600

keras-self-attention

Attention mechanism for processing sequential data that considers the context for each timestamp.

Language:PythonMIT65200

kuka_rl

Reinforcement Learning Experiments using PyBullet

Language:Jupyter NotebookApache-2.011000

tensor2robot

Distributed machine learning infrastructure for large-scale robotics research

Language:PythonApache-2.053400

Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Language:Jupyter NotebookApache-2.0103900

QT_Opt

Q-network with cross-entropy (CE) method for reinforcement learning.

Language:Jupyter NotebookBSD-3-Clause4500

ravens

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Language:PythonApache-2.054300

qt-opt

Language:Python300