Questions about the performance analysis of `FrozenBatchNorm2d`

Question

Questions about the performance analysis of `FrozenBatchNorm2d`

DrRyanHuang opened this issue 5 months ago · comments

Hi, Happy Spring Festival, thx for your great work!

I did a performance analysis on the inference of the torch code and it seems that the reshape operation in FrozenBatchNorm2d (src/nn/backbone/common.py) becomes a bottleneck

Is there any way to solve this problem?

lyuwenyu · Answer 1 · Sat Feb 17 2024 09:12:19 GMT+0800 (China Standard Time)

A simple solution is to replace FrozenBatchNorm2d with BatchNorm2d before deployment.
You can do this by adding a member function convert_to_deploy to the backbone.

def convert_to_deploy(self, ):
    # code repleace `FrozenBatchNorm2d` with `BatchNorm2d`

See this call stack

If there are any results, you can provide feedback.