image warp function

Question

image warp function

lxtGH opened this issue 6 years ago · comments

Hi ! Thanks for your code.
I want to warp image(feature map) to the next use optical flow, How can I do that use your code?

Kai Chen · Answer 1 · Sat Mar 03 2018 13:54:37 GMT+0800 (China Standard Time)

cvbase is a library independent to deep learning frameworks and provides some commonly used utils. To implement the warp function, you may reply on a certain framework, here is a pytorch example.

# update at 22/08/2018 with pytorch>=0.4.0
def flow_warp(x, flow, padding_mode='zeros'):
    """Warp an image or feature map with optical flow
    Args:
        x (Tensor): size (n, c, h, w)
        flow (Tensor): size (n, 2, h, w), values range from -1 to 1 (relevant to image width or height)
        padding_mode (str): 'zeros' or 'border'

    Returns:
        Tensor: warped image or feature map
    """
    assert x.size()[-2:] == flow.size()[-2:]
    n, _, h, w = x.size()
    x_ = torch.arange(w).view(1, -1).expand(h, -1)
    y_ = torch.arange(h).view(-1, 1).expand(-1, w)
    grid = torch.stack([x_, y_], dim=0).float().cuda()
    grid = grid.unsqueeze(0).expand(n, -1, -1, -1)
    grid[:, 0, :, :] = 2 * grid[:, 0, :, :] / (w - 1) - 1
    grid[:, 1, :, :] = 2 * grid[:, 1, :, :] / (h - 1) - 1
    grid += 2 * flow
    grid = grid.permute(0, 2, 3, 1)
    return F.grid_sample(x, grid, padding_mode=padding_mode)

# pytorch 0.3
def flow_warp(x, flow, padding_mode='zeros'):
    """Warp an image or feature map with optical flow
    Args:
        x (Variable): size (n, c, h, w)
        flow (Variable): size (n, 2, h, w), values range from -1 to 1 (relevant to image width or height)
        padding_mode (str): 'zeros' or 'border'

    Returns:
        Variable: warped image or feature map
    """
    assert x.size()[-2:] == flow.size()[-2:]
    n, _, h, w = x.size()
    x_ = torch.arange(w).view(1, -1).expand(h, -1)
    y_ = torch.arange(h).view(-1, 1).expand(-1, w)
    grid = torch.stack([x_, y_], dim=0).float().cuda()
    grid = grid.unsqueeze(0).expand(n, -1, -1, -1)
    grid[:, 0, :, :] = 2 * grid[:, 0, :, :] / (w - 1) - 1
    grid[:, 1, :, :] = 2 * grid[:, 1, :, :] / (h - 1) - 1
    grid = Variable(grid)
    grid += 2 * flow
    grid = grid.permute(0, 2, 3, 1)
    return F.grid_sample(x, grid, padding_mode=padding_mode)

Xiangtai Li · Answer 2 · Sat Mar 03 2018 15:24:36 GMT+0800 (China Standard Time)

thanks for your reply, but I found the dim of x and grid that doesn't match for the input of function F.grad_sample. Is x the input feature or image?

Kai Chen · Answer 3 · Sat Mar 03 2018 15:38:33 GMT+0800 (China Standard Time)

x can be either the image or input feature, as long as the shape of x is (n, c, h, w).

Kai Chen · Answer 4 · Thu Mar 08 2018 09:26:45 GMT+0800 (China Standard Time)

Hi @lxtGH , I found some typo in my examples, x, y should be x_ and y_ to avoid name conflict.

PK15946 · Answer 5 · Fri Jun 08 2018 20:36:11 GMT+0800 (China Standard Time)

thanks a lot for your code @hellock , I think edit "values range from 0 to 1" to"values range from -1 to 1" will be better, because I add width then dive 2*width to make the flow strictly range from 0 to 1, but it's wrong. Actually all you have to do is just dive width.

EnQing626 · Answer 6 · Wed Aug 22 2018 10:01:24 GMT+0800 (China Standard Time)

Thanks for @hellock code. I think the "values range from 0 to 1" should be edited in "values range from -1 to 1" because of the function's demand https://pytorch.org/docs/0.3.1/nn.html#grid-sample. And there is a small issue that should you bound the value of flow before "grid += 2 * flow"?

Kai Chen · Answer 7 · Wed Aug 22 2018 21:32:18 GMT+0800 (China Standard Time)

@PK15946 @CJEQ Thanks for your comments. The range of flow values should be [-1, 1] and I will update the code snippet.
@CJEQ do you mean clip the flow values in case of NaN?

EnQing626 · Answer 8 · Thu Aug 23 2018 10:24:10 GMT+0800 (China Standard Time)

@hellock Yes. grid += 2 * flow may cause the value out of bound. So I think it would be better to set grid += 2 * flow before grid[:, 0, :, :] = 2 * grid[:, 0, :, :] / (w - 1) - 1, grid[:, 1, :, :] = 2 * grid[:, 1, :, :] / (h - 1) - 1. Is this make sense?

Kai Chen · Answer 9 · Thu Aug 23 2018 10:45:45 GMT+0800 (China Standard Time)

@CJEQ The warped coordinate are not necessary to be restricted between [-1, 1], and it is usual that it may exceeds the image boundary. The argument padding_mode of grid_sample will handle such cases. The values of flow here are computed relative to the height/width of images or feature maps, so grid += 2 * flow cannot be moved before generating a uniform grid.

mikirui · Answer 10 · Thu Jan 24 2019 16:36:22 GMT+0800 (China Standard Time)

@hellock There may be some bugs in the codes:
grid = grid.unsqueeze(0).expand(n, -1, -1, -1)
grid[:, 0, :, :] = 2 * grid[:, 0, :, :] / (w - 1) - 1
grid[:, 1, :, :] = 2 * grid[:, 1, :, :] / (h - 1) - 1
grid += 2 * flow

since expand would not allocate new memory, thus when batch size (i.e. n) is greater than 1, grid will add up n times with 2 * flow for each item in the batch, which is unreasonable. I think use another variable like grid_x = grid + 2 * flow , or modify as flow = flow * 2 + grid and then call grid_sample function with flow, or use repeat instead of expand would be better.

Ziqi Zhang · Answer 11 · Wed Jun 05 2019 10:09:44 GMT+0800 (China Standard Time)

Hi @hellock , thanks for your code sharing. You point out that the input flow should range from -1 to 1 (relevant to image width or height), does that mean I should divide the original flow by width and height? (In my case the flow is bounded within +- 20). And can you tell me why do you multiply flow by 2 (grid += 2 * flow). Thanks

Julia Gong · Answer 12 · Wed Aug 28 2019 08:30:04 GMT+0800 (China Standard Time)

Hi! I was using this function for warping batches of images and noticed that with batch size greater than 1, there seems to be bleeding of information across the warped images (the flows seem to "contaminate" other images in the batch so that each warp does not only affect its corresponding one of the n images). I was wondering if someone could looking into this issue or point to what could be causing this? Thanks!

mikirui · Answer 13 · Wed Aug 28 2019 15:27:45 GMT+0800 (China Standard Time)

@juliagong You may refer to my last comment.

Julia Gong · Answer 14 · Thu Aug 29 2019 08:12:22 GMT+0800 (China Standard Time)

Thanks, @mikirui! I ended up making my way to this conclusion before seeing your earlier comment, which was indeed exactly the problem. Thanks for pointing it out. Hopefully, others with the same issue will find it faster!

MSLAwan · Answer 15 · Thu Oct 31 2019 22:47:12 GMT+0800 (China Standard Time)

@CJEQ "Yes. grid += 2 * flow may cause the value out of bound. So I think it would be better to set grid += 2 * flow before grid[:, 0, :, :] = 2 * grid[:, 0, :, :] / (w - 1) - 1, grid[:, 1, :, :] = 2 * grid[:, 1, :, :] / (h - 1) - 1. Is this make sense?"
yes you are right. grid += 2 * flow should be before grid normalization. otherwise it is not giving correct warpped output.

Qingzhe Gao · Answer 16 · Thu Dec 12 2019 16:50:53 GMT+0800 (China Standard Time)

@MSLAwan Could you tell me why we need use grid+=2*flow instead of grid+=flow? I am very confused