Martin Klissarov's repositories

PPOC

Proximal Policy Option-Critic

dceo

Learning diverse options through the Laplacian representation.

Language:PythonLicense:Apache-2.0Stargazers:20Issues:2Issues:3

phi_gcn

Reward Propagation using Graph Convolutional Networks

sagemaker_tutorial

Simple tutorials about SageMaker

Language:Jupyter NotebookStargazers:9Issues:1Issues:0

MOC

Flexible Option Learning

DAVF

Diffusion-Based Approximate Value Functions

Language:PythonStargazers:4Issues:1Issues:0

gym-extensions

This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement learning, etc.)

Language:PythonLicense:NOASSERTIONStargazers:2Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0
Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

deer

DEEp Reinforcement learning framework

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:4Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

motif

Intrinsic Motivation from Artificial Intelligence Feedback

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

nle

The NetHack Learning Environment

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0