ikostrikov

User data from Github https://github.com/ikostrikov

followers

following

stars

UC Berkeley

Berkeley

www.kostrikov.xyz

Organizations

VisualComputingInstitute

Ilya Kostrikov's repositories

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT3827 64 230

pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Language:PythonMIT1260 41 68

jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Language:Jupyter NotebookMIT672 12 8

pytorch-flows

PyTorch implementations of algorithms for density estimation

Language:PythonMIT581 17 8

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonMIT442 12 20

pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Language:PythonMIT313 14 10

pytorch-ddpg-naf

Implementation of algorithms for continuous control (DDPG and NAF).

Language:PythonMIT308 8 9

rlpd

Language:PythonMIT271 4 7

implicit_q_learning

Language:PythonMIT261 4 9

walk_in_the_park

Language:PythonMIT258 10 6

TensorFlow-Pointer-Networks

TensorFlow implementation of Pointer Networks

Language:PythonMIT203 11 10

pytorch-rl

jaxrl2

Language:Jupyter NotebookMIT47 4 2

dmcgym

Language:PythonMIT23 2 1

linenplus

Flax extensions.

Language:PythonMIT5 50

cql-results

Language:PythonApache-2.03 2 1

gail-experts

MIT3 3 1

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION3 20

motion_imitation

Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"

Language:PythonApache-2.02 10

doodad

Language:PythonGPL-3.01 10

Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Language:Python1 10

mazelab

A customizable framework to create maze and gridworld environments

Language:Python1 10

Mine_tf2.0

MINE: Mutual Information Neural Estimation in pytorch

Language:Jupyter Notebook1 40

roboverse

A set of environments utilizing pybullet for simulation of robotic manipulation tasks.

Language:PythonMIT1 10

unitree_sim

MuJoCo models for Unitree Robots

1 20

d4rl

A benchmark for offline reinforcement learning.

Language:PythonApache-2.0020

gym-wordle

Gym environment for playing Wordle with RL agents

Language:Python010

oatomobile

A research framework for autonomous driving

Language:PythonApache-2.0010

obj_2_mujoco_msh

Language:Python020

SMAAC

This repo contains the code of "Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic".

Language:PythonMPL-2.0020