Implicit compatible function approximation
tspooner opened this issue · comments
Thomas Spooner commented
Find a neat solution to using the policy score function as the features of an LFA instance. The issue at the moment is that the project
method only takes a single input. The score function variant would also require the action. There are loads of ways to do this, but we want something that won't require rethinking later down the line.