Ilya Kostrikov (ikostrikov)

ikostrikov

User data from Github https://github.com/ikostrikov

Company:UC Berkeley

Location:Berkeley

Home Page:www.kostrikov.xyz

GitHub:@ikostrikov

Twitter:@ikostrikov


Organizations
VisualComputingInstitute

Ilya Kostrikov's repositories

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3827Issues:64Issues:230

pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Language:PythonLicense:MITStargazers:1260Issues:41Issues:68

jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Language:Jupyter NotebookLicense:MITStargazers:672Issues:12Issues:8

pytorch-flows

PyTorch implementations of algorithms for density estimation

Language:PythonLicense:MITStargazers:581Issues:17Issues:8

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonLicense:MITStargazers:442Issues:12Issues:20

pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Language:PythonLicense:MITStargazers:313Issues:14Issues:10

pytorch-ddpg-naf

Implementation of algorithms for continuous control (DDPG and NAF).

Language:PythonLicense:MITStargazers:308Issues:8Issues:9
Language:PythonLicense:MITStargazers:271Issues:4Issues:7

TensorFlow-Pointer-Networks

TensorFlow implementation of Pointer Networks

Language:PythonLicense:MITStargazers:203Issues:11Issues:10
Language:Jupyter NotebookLicense:MITStargazers:47Issues:4Issues:2
Language:PythonLicense:MITStargazers:23Issues:2Issues:1

linenplus

Flax extensions.

Language:PythonLicense:MITStargazers:5Issues:5Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:1

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:3Issues:2Issues:0

motion_imitation

Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0
Language:PythonLicense:GPL-3.0Stargazers:1Issues:1Issues:0

Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Language:PythonStargazers:1Issues:1Issues:0

mazelab

A customizable framework to create maze and gridworld environments

Language:PythonStargazers:1Issues:1Issues:0

Mine_tf2.0

MINE: Mutual Information Neural Estimation in pytorch

Language:Jupyter NotebookStargazers:1Issues:4Issues:0

roboverse

A set of environments utilizing pybullet for simulation of robotic manipulation tasks.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

unitree_sim

MuJoCo models for Unitree Robots

d4rl

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

gym-wordle

Gym environment for playing Wordle with RL agents

Language:PythonStargazers:0Issues:1Issues:0

oatomobile

A research framework for autonomous driving

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

SMAAC

This repo contains the code of "Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic".

Language:PythonLicense:MPL-2.0Stargazers:0Issues:2Issues:0