Rosemary Ke (nke001)

nke001

Geek Repo

Github PK Tool:Github PK Tool

Rosemary Ke's repositories

causal_learning_unknown_interventions

Code for "Neural causal learning from unknown interventions"

sparse_attentive_backtracking_release

Code for our paper "Sparse Attentive Backtracking: Sparse Attentive Backtracking: Temporal Credit Assignment Through Reminding" https://papers.nips.cc/paper/7991-sparse-attentive-backtracking-temporal-credit-assignment-through-reminding.pdf

Language:PythonLicense:NOASSERTIONStargazers:36Issues:8Issues:1

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

c-swm-1

Contrastive Learning of Structured World Models

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

causal_induction

Codebase for "Causal Induction from Visual Observations for Goal-Directed Tasks"

Stargazers:0Issues:0Issues:0

coinrun

Code for the paper "Quantifying Transfer in Reinforcement Learning"

Language:C++License:MITStargazers:0Issues:0Issues:0

doodad

A job launching library for docker, EC2, etc.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0

GitPython

GitPython is a python library used to interact with Git repositories.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

guided-evolutionary-strategies

Guided Evolutionary Strategies

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gym-minigrid

Minimalistic gridworld environment for OpenAI Gym

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

mrcl

Code for the NeurIPS19 paper "Meta-Learning Representations for Continual Learning"

Language:PythonStargazers:0Issues:2Issues:0

mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

omniglot

Omniglot data set for one-shot learning

Language:MATLABLicense:MITStargazers:0Issues:2Issues:0

pytorch-a2c-ppo

A recurrent, multi-process and readable PyTorch implementation of the deep reinforcement algorithms A2C and PPO

Language:PythonStargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-maml

PyTorch implementation of MAML: https://arxiv.org/abs/1703.03400

License:MITStargazers:0Issues:0Issues:0

pytorch-noreward-rl

pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Pytorch-UNet

PyTorch implementation of the U-Net for image semantic segmentation with high quality images

License:GPL-3.0Stargazers:0Issues:0Issues:0

pytorchrl

Deep Reinforcement Learning algorithms implemented in PyTorch

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

retro

Retro Games in Gym

Language:C++License:MITStargazers:0Issues:2Issues:0

retro-baselines

Publicly releasable baselines for the Retro contest

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rlkit

Collection of reinforcement learning algorithms

Language:PythonStargazers:0Issues:3Issues:0

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

Sectar

Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings

Language:PythonStargazers:0Issues:2Issues:0

spriteworld

Spriteworld: a flexible, configurable python-based reinforcement learning environment

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

STOVE

Anonymous ICLR 2020 Submission: Structured Object-Aware Physics Prediction for Video Modelling and Planning

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

WorldModelsExperiments

World Models Experiments

Language:Jupyter NotebookStargazers:0Issues:0Issues:0