weidler / RLaSpa

Reinforcement Learning in Latent Space

weidler/RLaSpa Issues

Create template for experiments in Wiki
Closed 5 years ago
Double Check the Workings of all Networks
Closed 6 years ago
Include MultiTask Possibility into Framework
Closed 6 years ago
Prepare job script for Aachen cluster
Closed 6 years ago
Create new Tasks
Closed 6 years ago1
Make tensorboard logs get their own directories
Closed 6 years ago
Use 4 frames of Race/Evasion as one state
Updated 6 years ago
Check how DQN memory influence training when using ckpts, eventually save memory in ckpts as well.
Closed 6 years ago
Add agent.save() and agent.load()
Closed 6 years ago
Create different tasks/obstacle maps to "transfer"
Closed 6 years ago3
Develop a better metric for the progress report
Closed 6 years ago5
Verify if GPU is indeed utilized
Closed 6 years ago
Exploration strategy
Closed 6 years ago
Create statistics of training time and performance
Closed 6 years ago2
Allow the DQN Policy to backpropagate through the representation module
Closed 6 years ago
Have rendered testruns automatically saved to a gif file
Closed 6 years ago
Make Race/Evasion tasks customizable (kwargs for the register)
Closed 6 years ago
Tensorize everything
Closed 6 years ago
Check if input is as expected and throw meaningful errors.
Updated 6 years ago1
Document Code, include type hinting!
Updated 6 years ago1
Look at the heads of the Janus and Cerberus architecture
Closed 6 years ago
Batch Learning in Representation Learners
Closed 6 years ago
Being able to save and continue training a model
Closed 6 years ago4
Make tasks implement the gym interface
Closed 6 years ago1
Modular Framework for different RL Approaches
Closed 6 years ago4
Learn ObstaclePathing in latent space
Closed 6 years ago
Try to learn ObstaclePathing
Closed 6 years ago6
Try to learn Pathing in latent space
Closed 6 years ago
Test RL agent on (padded) two-task latent representation
Closed 6 years ago
Implement SiameseAutoencoder
Closed 6 years ago1
Make pathing task use numpy arrays for faster conversions
Closed 6 years ago3
Create DQN framework
Closed 6 years ago2
Implement
Closed 6 years ago
qwdqwd
Closed 6 years ago