lzyyy58

lzyyy58

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

lzyyy58's starred repositories

imitation_learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Language:PythonStargazers:131Issues:0Issues:0

DDPGfD

DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.

Language:PythonStargazers:28Issues:0Issues:0

robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Language:PythonLicense:MITStargazers:520Issues:0Issues:0

Rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

Language:PythonLicense:Apache-2.0Stargazers:406Issues:0Issues:0

bdq_sb

Fork of stable baselines, provides implementation of BDQ algorithm https://arxiv.org/abs/1711.08946

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

Confidence-Aware-Imitation-Learning

Official implementation of the NeurIPS 2021 paper: S Zhang, Z Cao, D Sadigh, Y Sui: "Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality"

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonLicense:NOASSERTIONStargazers:17983Issues:0Issues:0

RHER

The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Tasks with Sparse Rewards”

Language:PythonLicense:MITStargazers:134Issues:0Issues:0

rlkit-relational

Codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"

Language:PythonLicense:MITStargazers:97Issues:0Issues:0

Student-Information-Management-System

学生信息管理系统 JAVA Mysql 数据库课程设计 简单界面

Language:JavaLicense:EPL-2.0Stargazers:134Issues:0Issues:0

Student-dormitory-management-system

学生宿舍管理系统(GUI):使用maven进行项目构建管理,使用javaFX和JFoenix设置界面,使用mysql数据库,业务流程使用mybatis加spring

Language:JavaStargazers:36Issues:0Issues:0

StudentAchievementManagementSystem

Java+SQLServer学生成绩管理系统(代码+数据库)

Language:JavaStargazers:172Issues:0Issues:0

Non-Local-NN-Pytorch

PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)

Language:PythonLicense:MITStargazers:238Issues:0Issues:0

interact

Implementations of deep reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

reinforcement_learning_phasic_policy_gradient

Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow

Language:PythonLicense:GPL-3.0Stargazers:19Issues:0Issues:0

alf

Agent Learning Framework https://alf.readthedocs.io

Language:PythonLicense:Apache-2.0Stargazers:292Issues:0Issues:0
Language:PythonStargazers:28Issues:0Issues:0

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language:PythonLicense:MITStargazers:971Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:4952Issues:0Issues:0

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

Language:PythonLicense:MITStargazers:244Issues:0Issues:0

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:4100Issues:0Issues:0

DRLib

DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.

Language:PythonLicense:MITStargazers:500Issues:0Issues:0

RL-Projects-SK

Reinforcement Learning Projects

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:16Issues:0Issues:0

keras-self-attention

Attention mechanism for processing sequential data that considers the context for each timestamp.

Language:PythonLicense:MITStargazers:652Issues:0Issues:0

kuka_rl

Reinforcement Learning Experiments using PyBullet

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:110Issues:0Issues:0

tensor2robot

Distributed machine learning infrastructure for large-scale robotics research

Language:PythonLicense:Apache-2.0Stargazers:534Issues:0Issues:0

Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1039Issues:0Issues:0

QT_Opt

Q-network with cross-entropy (CE) method for reinforcement learning.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:45Issues:0Issues:0

ravens

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Language:PythonLicense:Apache-2.0Stargazers:543Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0