Ilya Kostrikov (ikostrikov)

ikostrikov

Geek Repo

Company:UC Berkeley

Location:Berkeley

Home Page:www.kostrikov.xyz

Twitter:@ikostrikov

Github PK Tool:Github PK Tool


Organizations
VisualComputingInstitute

Ilya Kostrikov's repositories

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3489Issues:68Issues:229

pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Language:PythonLicense:MITStargazers:1195Issues:44Issues:67

jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Language:Jupyter NotebookLicense:MITStargazers:585Issues:12Issues:8

pytorch-flows

PyTorch implementations of algorithms for density estimation

Language:PythonLicense:MITStargazers:568Issues:19Issues:8

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonLicense:MITStargazers:415Issues:13Issues:20

pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Language:PythonLicense:MITStargazers:309Issues:16Issues:10

pytorch-ddpg-naf

Implementation of algorithms for continuous control (DDPG and NAF).

Language:PythonLicense:MITStargazers:304Issues:10Issues:9

TensorFlow-Pointer-Networks

TensorFlow implementation of Pointer Networks

Language:PythonLicense:MITStargazers:205Issues:12Issues:10
Language:PythonLicense:MITStargazers:178Issues:4Issues:6
Language:Jupyter NotebookLicense:MITStargazers:39Issues:5Issues:2
Language:PythonLicense:MITStargazers:23Issues:3Issues:1

linenplus

Flax extensions.

Language:PythonLicense:MITStargazers:5Issues:6Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:4Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3Issues:3Issues:1

Mine_tf2.0

MINE: Mutual Information Neural Estimation in pytorch

Language:Jupyter NotebookStargazers:2Issues:4Issues:0

motion_imitation

Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"

Language:PythonLicense:Apache-2.0Stargazers:2Issues:2Issues:0
Language:PythonLicense:GPL-3.0Stargazers:1Issues:2Issues:0

Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Language:PythonStargazers:1Issues:2Issues:0

mazelab

A customizable framework to create maze and gridworld environments

Language:PythonStargazers:1Issues:2Issues:0

roboverse

A set of environments utilizing pybullet for simulation of robotic manipulation tasks.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

unitree_sim

MuJoCo models for Unitree Robots

d4rl

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

gym-wordle

Gym environment for playing Wordle with RL agents

Language:PythonStargazers:0Issues:2Issues:0

oatomobile

A research framework for autonomous driving

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

SMAAC

This repo contains the code of "Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic".

Language:PythonLicense:MPL-2.0Stargazers:0Issues:2Issues:0