tspooner / rsrl

A fast, safe and easy to use reinforcement learning framework in Rust.

Home Page:https://crates.io/crates/rsrl

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Implicit compatible function approximation

tspooner opened this issue · comments

Find a neat solution to using the policy score function as the features of an LFA instance. The issue at the moment is that the project method only takes a single input. The score function variant would also require the action. There are loads of ways to do this, but we want something that won't require rethinking later down the line.