Beast code in Giters

Rosemary Ke's repositories

causal_learning_unknown_interventions

Code for "Neural causal learning from unknown interventions"

Language:C98 7 2

sparse_attentive_backtracking_release

Code for our paper "Sparse Attentive Backtracking: Sparse Attentive Backtracking: Temporal Credit Assignment Through Reminding" https://papers.nips.cc/paper/7991-sparse-attentive-backtracking-temporal-credit-assignment-through-reminding.pdf

Language:PythonNOASSERTION36 8 1

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT020

c-swm-1

Contrastive Learning of Structured World Models

Language:PythonMIT020

causal_induction

Codebase for "Causal Induction from Visual Observations for Goal-Directed Tasks"

000

coinrun

Code for the paper "Quantifying Transfer in Reinforcement Learning"

Language:C++MIT000

doodad

A job launching library for docker, EC2, etc.

Language:PythonMIT020

gated-path-planning-networks

Language:Python000

GitPython

GitPython is a python library used to interact with Git repositories.

Language:PythonBSD-3-Clause020

guided-evolutionary-strategies

Guided Evolutionary Strategies

Language:Jupyter NotebookApache-2.0000

gym-minigrid

Minimalistic gridworld environment for OpenAI Gym

Language:PythonBSD-3-Clause000

mrcl

Code for the NeurIPS19 paper "Meta-Learning Representations for Continual Learning"

Language:Python020

mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Language:PythonNOASSERTION020

nke001.github.io

Language:HTML000

omniglot

Omniglot data set for one-shot learning

Language:MATLABMIT020

pytorch-a2c-ppo

A recurrent, multi-process and readable PyTorch implementation of the deep reinforcement algorithms A2C and PPO

Language:Python000

pytorch-a2c-ppo-acktr

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).

Language:PythonMIT000