Yixuan Huang's repositories
Points2Plans
Official implementation of Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
bullet3
Mainly focus is the racecar in the pybullet. Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
cpo
Constrained Policy Optimization
cs294-112_hws
My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning
gps
Guided Policy Search
gym
A toolkit for developing and comparing reinforcement learning algorithms.
handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
mirage-rl-trpo
Fork of https://github.com/ikostrikov/pytorch-trpo with modifications for the paper "The Mirage of Action-Dependent Baselines in Reinforcement Learning".
models
Models and examples built with TensorFlow
object_collections
Code to accompany our CoRL 2019 paper
ppo
Proximal Policy Optimization implementation with TensorFlow
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
PyTorch-Tutorial
Build your neural network easy and fast
Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
sornet
Code for SORNet: Spatial Object-Centric Representations for Sequential Manipulation in CoRL 2021 (Best Systems Paper Finalist)
trex-gym
OpenAI Gym environment using pybullet for a Tyrannosaur.
trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
TRPO-TensorFlow
Trust Region Policy Optimization (TRPO) in pure TensorFlow