Extracting attention value

Question

NamHyelin opened this issue a year ago · comments

Hi,
I want to extract decoder's cross attention value,
but there is no option or way to return attention layers.
Can you tell how should I do?
Thanks