yixuanhuang98

Yixuan Huang's repositories

Points2Plans

Official implementation of Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics

Language:PythonMIT2600

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT000

bullet3

Mainly focus is the racecar in the pybullet. Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++NOASSERTION000

cpo

Constrained Policy Optimization

Language:Python000

cs294-112_hws

My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning

Language:PythonMIT000

experiments-with-neural-style-transfer

Language:Jupyter Notebook000

gps

Guided Policy Search

Language:PythonNOASSERTION000

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION000

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:PythonMIT000

handful-of-trials-pytorch

Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:Python000

mirage-rl-trpo

Fork of https://github.com/ikostrikov/pytorch-trpo with modifications for the paper "The Mirage of Action-Dependent Baselines in Reinforcement Learning".

Language:PythonMIT000

models

Models and examples built with TensorFlow

Language:PythonApache-2.0000

object_collections

Code to accompany our CoRL 2019 paper

000

ppo

Proximal Policy Optimization implementation with TensorFlow

Language:PythonMIT000

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT000

yixuanhuang98

Yixuan Huang's repositories

Points2Plans

baselines

bullet3

cpo

cs294-112_hws

experiments-with-neural-style-transfer

gps

gym

handful-of-trials

handful-of-trials-pytorch

mirage-rl-trpo

model-based-ppo

models

object_collections

ppo

pytorch-a2c-ppo-acktr-gail

PyTorch-Tutorial

Reinforcement-Learning

Reinforcement-learning-with-tensorflow

rllab

sornet

trex-gym

trpo

TRPO-TensorFlow

urdf_tutorial

yixuanhuang98.github.io