third dimension issue with our images when using Cityscapes weight

Question

third dimension issue with our images when using Cityscapes weight

Shiv1143 opened this issue 2 years ago · comments

While using cityscapes weight,I faced errors with regarding to third dimension of our image related to encoder features in model.py .so I manipulated the third dimension by slicing it to the appropriate dimension to make it work according to our images dimension.can you solve that issue from your end

Daniel Seichter · Answer 1 · Mon Feb 20 2023 15:53:39 GMT+0800 (China Standard Time)

Thanks for reporting any issue. Can you add a link that points to the exact line?

Shivam Kumar · Answer 2 · Sun Dec 31 2023 08:41:56 GMT+0800 (China Standard Time)

sorry I wasn' t aware that someone has replied,.. help me in creating the pull request for this issue

Daniel Seichter · Answer 3 · Sun Dec 31 2023 14:55:46 GMT+0800 (China Standard Time)

I still do not understand the problem exactly. Can you give us more details and point to the line in the code?

Shivam Kumar · Answer 4 · Sun Dec 31 2023 21:59:08 GMT+0800 (China Standard Time)

In model.py file in forward function:
def forward(self, decoder_features, encoder_features):
out = self.conv3x3(decoder_features)
out = self.decoder_blocks(out)

    if self.training:
        out_side = self.side_output(out)
    else:
        out_side = None

    out = self.upsample(out)

    if self.encoder_decoder_fusion == 'add':
        out += encoder_features

    return out, out_side

Here out is not coming correctly in cityscapes so I made this change which I think will work across every weights:
def forward(self, decoder_features, encoder_features):
out = self.conv3x3(decoder_features)
out = self.decoder_blocks(out)

    if self.training:
        out_side = self.side_output(out)
    else:
        out_side = None

    **out = self.upsample(out)
    if(out.shape != encoder_features.shape):
        out = out[:,:,:,:encoder_features.shape[3]]**
    if self.encoder_decoder_fusion == 'add':
        out += encoder_features
    return out, out_side

you can verify from your side as well.

Shivam Kumar · Answer 5 · Sun Dec 31 2023 22:00:21 GMT+0800 (China Standard Time)

and is there a way where we can raise pull request for this issue?

Daniel Seichter · Answer 6 · Tue Jan 02 2024 20:28:17 GMT+0800 (China Standard Time)

Hmm, ... actually, this should not happen - can you share the the full python command you execute including all arguments as well as some details about your python environment (pip list, conda list)?

Shivam Kumar · Answer 7 · Thu Jan 04 2024 13:19:38 GMT+0800 (China Standard Time)

I generally use Conda enviornment. Actually I run it quite long back so can't remember the command but the issue was that while running cityscapes weights the 3rd dimension isn't aligned with the encoder shape and have more length than expected so I modified to make it work.

Daniel Seichter · Answer 8 · Mon Jan 08 2024 22:00:00 GMT+0800 (China Standard Time)

I guess, you just picked the wrong context module and/or input resolution - both are different for Cityscapes compared to NYUv2/SUNRGB-D.
I will close this issue as we are not able to reproduce this issue (anymore).