David Tao (taodav)

taodav

Geek Repo

Company:Brown University

Location:Providence

Home Page:taodav.cc

Github PK Tool:Github PK Tool

David Tao's repositories

nsrs

Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.

Language:Jupyter NotebookLicense:MITStargazers:13Issues:0Issues:0

lstm-contextual-decomposition

Reproducing "Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs"

Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0

aux-inputs

reinforcement learning with auxiliary inputs

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

generalization-rl

Code for our CMPUT 607 project, based on the paper "Protecting Against Evaluation Overfitting in Empirical Reinforcement Learning"

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

TextWorldACG

Scripts for generating the TextWorldACG dataset (https://arxiv.org/abs/1812.00855)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

balloon-learning-environment

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bandits

Just some stuff on bandits

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:0Issues:0

cs-useful-things

Useful things that I've accumulated as an undergrad/grad student studying Computer Science.

Stargazers:0Issues:0Issues:0

GANs

PyTorch implementations of GAN models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

jelly-bean-world

A framework for experimenting with never-ending learning

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

kobuddy

Kobo database backup and parser: extracts notes, highlights, reading progress and more

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mc

minecraft server for the frens

Language:ShellStargazers:0Issues:0Issues:0

MCTS

Monte Carlo Tree Search for Q-value approximation

Language:PythonStargazers:0Issues:0Issues:0

meta-learning

Implementations of meta-learning algorithms in TensorFlow. For use in one-shot facial recognition.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MuZero

A structured implementation of MuZero

Language:PythonStargazers:0Issues:0Issues:0

onager

Lightweight python library for launching experiments and tuning hyperparameters, either locally or on a cluster

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

personal-site

My personal website - built with React, React-Router, React-Snap for Static-Export, and GitHub Pages.

Language:SCSSLicense:MITStargazers:0Issues:0Issues:0

Pruning-NNs

Implementing neural network pruning

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

qmk_firmware

Open-source keyboard firmware for Atmel AVR and Arm USB families

Language:CLicense:GPL-2.0Stargazers:0Issues:0Issues:0

Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

rl-competition

Repository for the 2009 RL Competition codebase

Language:JavaStargazers:0Issues:0Issues:0

RL-Coursera

Implementations of Coursera Reinforcement Learning Specialization

License:MITStargazers:0Issues:0Issues:0

rlpyt

Reinforcement Learning in PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Sketch-RNN

Pytorch (again) implementation of sketch-rnn.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

slack-bixi-bot

A small slack bot to check the status of a given bixi status

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:0Issues:0

WorldModels

Reproducing/Extending the World Models paper (https://worldmodels.github.io/)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0