tsmir's starred repositories

long_range_models

Simple implementations of long-range sequence models (LRU, S5, S4, and more).

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

la-mbda

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

gym-safety

Simple gym environments for safety in Reinforcement Learning Research

Language:PythonStargazers:15Issues:0Issues:0