cvoelcker / vagram_quadratic

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Higher-order Taylor approximations for Value-gradient Aware Model Learning

This is the codebase for our course project for CSC 2541 Winter 2022 taught by Prof Roger Grosse.

To replicate the visualization presented in the paper, use the visualization.py script.

To replicate the RL experiments, use the provided sbatch shell script and the hyperparameters listed in the rl_experiments.sh script.

About


Languages

Language:Python 94.9%Language:Shell 5.1%