ikostrikov

followers

following

stars

UC Berkeley

Berkeley

www.kostrikov.xyz

Organizations

VisualComputingInstitute

Ilya Kostrikov's repositories

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT3489 68 229

pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Language:PythonMIT1195 44 67

jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Language:Jupyter NotebookMIT585 12 8

pytorch-flows

PyTorch implementations of algorithms for density estimation

Language:PythonMIT568 19 8

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonMIT415 13 20

pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Language:PythonMIT309 16 10

pytorch-ddpg-naf

Implementation of algorithms for continuous control (DDPG and NAF).

Language:PythonMIT304 10 9

walk_in_the_park

Language:PythonMIT235 12 5

implicit_q_learning

Language:PythonMIT210 5 9

TensorFlow-Pointer-Networks

TensorFlow implementation of Pointer Networks

Language:PythonMIT205 12 10

rlpd

Language:PythonMIT178 4 6

pytorch-rl

jaxrl2

Language:Jupyter NotebookMIT39 5 2

dmcgym

Language:PythonMIT23 3 1

linenplus

Flax extensions.

Language:PythonMIT5 60

gail-experts

MIT4 4 1

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION4 20

cql-results

Language:PythonApache-2.03 3 1

Mine_tf2.0

MINE: Mutual Information Neural Estimation in pytorch

Language:Jupyter Notebook2 40

motion_imitation

Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"

Language:PythonApache-2.02 20

doodad

Language:PythonGPL-3.01 20

Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Language:Python1 20

mazelab

A customizable framework to create maze and gridworld environments

Language:Python1 20

roboverse

A set of environments utilizing pybullet for simulation of robotic manipulation tasks.

Language:PythonMIT1 20

unitree_sim

MuJoCo models for Unitree Robots

1 20

d4rl

A benchmark for offline reinforcement learning.

Language:PythonApache-2.0020

gym-wordle

Gym environment for playing Wordle with RL agents

Language:Python020

oatomobile

A research framework for autonomous driving

Language:PythonApache-2.0020

obj_2_mujoco_msh

Language:Python020

SMAAC

This repo contains the code of "Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic".

Language:PythonMPL-2.0020