PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
NamHyelin opened this issue a year ago · comments
Hi, I want to extract decoder's cross attention value, but there is no option or way to return attention layers. Can you tell how should I do? Thanks