taodav

David Tao's repositories

nsrs

Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.

Language:Jupyter NotebookMIT1300

lstm-contextual-decomposition

Reproducing "Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs"

Language:Jupyter NotebookMIT500

aux-inputs

reinforcement learning with auxiliary inputs

Language:Jupyter NotebookMIT100

generalization-rl

Code for our CMPUT 607 project, based on the paper "Protecting Against Evaluation Overfitting in Empirical Reinforcement Learning"

Language:PythonMIT100

TextWorldACG

Scripts for generating the TextWorldACG dataset (https://arxiv.org/abs/1812.00855)

Language:PythonMIT100

balloon-learning-environment

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Language:Jupyter NotebookApache-2.0000

bandits

Just some stuff on bandits

Language:Jupyter Notebook000

COMP421-D3-Q2

stuff

Language:Java000

cs-useful-things

Useful things that I've accumulated as an undergrad/grad student studying Computer Science.

000

GANs

PyTorch implementations of GAN models.

Language:PythonMIT000

grl

Language:Python000

jelly-bean-world

A framework for experimenting with never-ending learning

Language:C++Apache-2.0000

kobuddy

Kobo database backup and parser: extracts notes, highlights, reading progress and more

Language:PythonMIT000

mc

minecraft server for the frens

Language:Shell000

MCTS

Monte Carlo Tree Search for Q-value approximation

Language:Python000

meta-learning

Implementations of meta-learning algorithms in TensorFlow. For use in one-shot facial recognition.

Language:PythonMIT000

MuZero

A structured implementation of MuZero

Language:Python000

onager

Lightweight python library for launching experiments and tuning hyperparameters, either locally or on a cluster

Language:PythonMIT000

personal-site

My personal website - built with React, React-Router, React-Snap for Static-Export, and GitHub Pages.

Language:SCSSMIT000

Pruning-NNs

Implementing neural network pruning

Language:Jupyter NotebookMIT000

qmk_firmware

Open-source keyboard firmware for Atmel AVR and Arm USB families

Language:CGPL-2.0000

Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

Language:PythonMIT000

rewardpredictive

Language:Jupyter NotebookMIT000

rl-competition

Repository for the 2009 RL Competition codebase

Language:Java000

RL-Coursera

Implementations of Coursera Reinforcement Learning Specialization

MIT000

rlpyt

Reinforcement Learning in PyTorch

Language:PythonMIT000

Sketch-RNN

Pytorch (again) implementation of sketch-rnn.

Language:PythonMIT000

slack-bixi-bot

A small slack bot to check the status of a given bixi status

Language:PythonMIT000

stuff

Language:Shell000

WorldModels

Reproducing/Extending the World Models paper (https://worldmodels.github.io/)

Language:PythonMIT000