Refinement Network
lalalune opened this issue · comments
Instead of trying to predict all of the tokens at once, we should predict some, keep the ones with high confident, and try again with a modify attention mask.
lalalune opened this issue · comments
Instead of trying to predict all of the tokens at once, we should predict some, keep the ones with high confident, and try again with a modify attention mask.