I wanna ask that How to modify the code to train 3D images?

Question

I wanna ask that How to modify the code to train 3D images?

InvincibleXiao opened this issue 3 years ago · comments

InvincibleXiao commented 3 years ago

Qibin Hou · Answer 1 · Mon Mar 08 2021 12:46:31 GMT+0800 (China Standard Time)

Do you mean point cloud data?

InvincibleXiao · Answer 2 · Mon Mar 08 2021 12:56:32 GMT+0800 (China Standard Time)

Do you mean point cloud data?

I mean 3D medical images。

Qibin Hou · Answer 3 · Mon Mar 08 2021 13:15:59 GMT+0800 (China Standard Time)

I think it is correct.

InvincibleXiao · Answer 4 · Mon Mar 08 2021 13:30:08 GMT+0800 (China Standard Time)

I think it is correct.

I am not sure about why we concat at the dim=2.If I concat the 3D images at dim=2 .Is that work?

Qibin Hou · Answer 5 · Tue Mar 09 2021 13:43:58 GMT+0800 (China Standard Time)

It needs a try.

yan4821567 · Answer 6 · Fri Mar 12 2021 00:42:58 GMT+0800 (China Standard Time)

It needs a try.

I also want to use it in 3D images. I imitated the code under the 2D images to rewrite it into the 3D. But the main problem I met is 【torch.cat()】

Assume I have a 3D dataset, the shape of input is [2,32,112,160,128], which indicates [batch,channel,height,width,depth].
And I use:
self.pool_h = nn.AdaptiveAvgPool3d((None, None,1))
self.pool_w = nn.AdaptiveAvgPool3d((None,1,None))
self.pool_d = nn.AdaptiveAvgPool3d((1,None,None))
......
x_h = self.pool_h(x)
x_w = self.pool_w(x).permute(0, 1, 2, 4, 3)
x_d = self.pool_d(x).permute(0, 1, 4, 3, 2)

Then I get:
x_h.shape == [2,32,112,160,1]
x_w.shape == [2,32,112,128,1]
x_d.shape == [2,32,128,160,1]

Now, I can use torch.cat([x_h,x_w],3) to get a tensor like [2,32,112,288,1], but I don't know how to concatenate [x_h,x_w,x_d]. Can you give some advice?