Giters
weidler
/
RLaSpa
Reinforcement Learning in Latent Space
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
5
Watchers:
5
Issues:
34
Forks:
1
weidler/RLaSpa Issues
Create template for experiments in Wiki
Closed
5 years ago
Double Check the Workings of all Networks
Closed
6 years ago
Include MultiTask Possibility into Framework
Closed
6 years ago
Prepare job script for Aachen cluster
Closed
6 years ago
Create new Tasks
Closed
6 years ago
Comments count
1
Make tensorboard logs get their own directories
Closed
6 years ago
Use 4 frames of Race/Evasion as one state
Updated
6 years ago
Check how DQN memory influence training when using ckpts, eventually save memory in ckpts as well.
Closed
6 years ago
Add agent.save() and agent.load()
Closed
6 years ago
Create different tasks/obstacle maps to "transfer"
Closed
6 years ago
Comments count
3
Develop a better metric for the progress report
Closed
6 years ago
Comments count
5
Verify if GPU is indeed utilized
Closed
6 years ago
Exploration strategy
Closed
6 years ago
Create statistics of training time and performance
Closed
6 years ago
Comments count
2
Allow the DQN Policy to backpropagate through the representation module
Closed
6 years ago
Have rendered testruns automatically saved to a gif file
Closed
6 years ago
Make Race/Evasion tasks customizable (kwargs for the register)
Closed
6 years ago
Tensorize everything
Closed
6 years ago
Check if input is as expected and throw meaningful errors.
Updated
6 years ago
Comments count
1
Document Code, include type hinting!
Updated
6 years ago
Comments count
1
Look at the heads of the Janus and Cerberus architecture
Closed
6 years ago
Batch Learning in Representation Learners
Closed
6 years ago
Being able to save and continue training a model
Closed
6 years ago
Comments count
4
Make tasks implement the gym interface
Closed
6 years ago
Comments count
1
Modular Framework for different RL Approaches
Closed
6 years ago
Comments count
4
Learn ObstaclePathing in latent space
Closed
6 years ago
Try to learn ObstaclePathing
Closed
6 years ago
Comments count
6
Try to learn Pathing in latent space
Closed
6 years ago
Test RL agent on (padded) two-task latent representation
Closed
6 years ago
Implement SiameseAutoencoder
Closed
6 years ago
Comments count
1
Make pathing task use numpy arrays for faster conversions
Closed
6 years ago
Comments count
3
Create DQN framework
Closed
6 years ago
Comments count
2
Implement
Closed
6 years ago
qwdqwd
Closed
6 years ago