Alexander Nikulin's repositories

faster-trajectory-transformer

Implementation of Trajectory Transformer with attention caching and batched beam search

Language:PythonLicense:MITStargazers:99Issues:1Issues:7

prioritized_experience_replay

Prioritized Experience Replay implementation with proportional prioritization

Language:PythonLicense:MITStargazers:56Issues:2Issues:1

sac-n-jax

Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch

Language:PythonLicense:MITStargazers:43Issues:1Issues:1

evolution_strategies_openai

implementation of "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" OpenAI paper

Language:PythonStargazers:13Issues:3Issues:0

link_pred_spark

similarity between graph nodes based on local information with PySpark

Language:PythonStargazers:9Issues:2Issues:0

chess_minimax

minimax algorithm for chess with alpha-beta pruning

Language:Jupyter NotebookLicense:MITStargazers:7Issues:2Issues:0

average_reward_ppo

Implementation of "Average-Reward Reinforcement Learning with Trust Region Methods" paper.

Language:PythonStargazers:6Issues:1Issues:0

MHRW

metropolis-hastings random walk with PySpark

Language:Jupyter NotebookStargazers:5Issues:2Issues:0

cic_gym

Adaptation of original "Contrastive Intrinsic Control for Unsupervised Skill Discovery" implementation to OpenAI Gym

halfcheetah_experts

expert policies for forward and backflip halfcheetah envs

Language:PythonStargazers:2Issues:1Issues:0

JaxMARL

Multi-Agent Reinforcement Learning with JAX

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

pgx

🎲 Vectorized RL game environments written in JAX with end-to-end AlphaZero examples

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

cic

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

Language:PythonStargazers:0Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

d4rl

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

hse_recsys

hse recommender systems course

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

hse_reinforcement_learning

HSE Reinforcement Learning course

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

linear-transformer-experiments

Experiments using fast linear transformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

link_pred

link prediction in social network based on node neighborhoods

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

memory-maze

Evaluating long-term memory of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Language:CythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tomita-gym

[TBD] Environments based on Tomita Grammars

License:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

vector-quantize-pytorch

Vector Quantization, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0