simsimi's repositories
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:PythonMIT000
Language:Python000
dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:PythonApache-2.0000
Language:Python000
Language:Python000
Language:C000
machine_learning
without dataset
Language:Jupyter Notebook000
Language:HTML000
Language:C000
Language:Python000
UPDeT
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight))
MIT000