zengyan-97 / X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Code for Grad-CAM visualization

qiaomu-miao opened this issue · comments

Hi,

Thanks for the great work.

In Figure 3 of your paper, you showed the Grad-CAM visualizations of your model on RefCOCO+ from text descriptions. Could you share the code for using Grad-CAM on your model?

Thanks!

Have you figured out how to generate the heatmaps?

I tried using torchcam, but I'm only getting a point and not a 2D grid for the activate map.

Hi,
I have not used grad-cam for a long time.
You can refer to this code: https://github.com/salesforce/ALBEF/blob/main/visualization.ipynb for grad-cam.