tensor is not a torch image

Question

tensor is not a torch image

18306125266 opened this issue 3 years ago · comments

Hello，I have a new problem. I want to test this model on my samples . I have got the rgb images and depth images .But i can not run the inference_samples.py normally .There report 'tensor is not a torch image' . Can you help me? Thank you ~

Daniel Seichter · Answer 1 · Wed Jun 16 2021 17:13:06 GMT+0800 (China Standard Time)

The error description is pretty short. Can you please provide some further information, i.e., environment (conda list / pip list), folder structure, executed command, and full error trace).

18306125266 · Answer 2 · Wed Jun 16 2021 17:21:45 GMT+0800 (China Standard Time)

The error description is pretty short. Can you please provide some further information, i.e., environment (conda list / pip list), folder structure, executed command, and full error trace).

I created the rgbd_segmentation environment and prepared sunrgbd dataset.

Then run inference_sample.py

python inference_samples.py --dataset sunrgbd --ckpt_path ./trained_models/sunrgbd/r34_NBt1D.pth --depth_scale 1 --raw_depth
Loaded SUNRGBD dataset without files
Loaded SUNRGBD dataset without files
/data/nas/workspace/jupyter/bisenetv2/ESANet-main/src/build_model.py:29: UserWarning: Argument --channels_decoder is ignored when --decoder_chanels_mode decreasing is set.
warnings.warn('Argument --channels_decoder is ignored when '
/data/nas/workspace/jupyter/bisenetv2/ESANet-main/src/models/resnet.py:101: UserWarning: parameters groups, base_width and norm_layer are ignored in NonBottleneck1D
warnings.warn('parameters groups, base_width and norm_layer are '
/data/nas/workspace/jupyter/bisenetv2/ESANet-main/src/models/model.py:163: UserWarning: for the context module the learned upsampling is not possible as the feature maps are not upscaled by the factor 2. We will use nearest neighbor instead.
warnings.warn('for the context module the learned upsampling is '
Device: cpu
.......
Loaded checkpoint from ./trained_models/sunrgbd/r34_NBt1D.pth
Traceback (most recent call last):
File "inference_samples.py", line 73, in
sample = preprocessor({'image': img_rgb, 'depth': img_depth})
File "/home/admin/.conda/envs/rgbd_segmentation/lib/python3.7/site-packages/torchvision/transforms/transforms.py", line 70, in call
img = t(img)
File "/data/nas/workspace/jupyter/bisenetv2/ESANet-main/src/preprocessing.py", line 195, in call
mean=self._depth_mean, std=self._depth_std)(depth)
File "/home/admin/.conda/envs/rgbd_segmentation/lib/python3.7/site-packages/torchvision/transforms/transforms.py", line 175, in call
return F.normalize(tensor, self.mean, self.std, self.inplace)
File "/home/admin/.conda/envs/rgbd_segmentation/lib/python3.7/site-packages/torchvision/transforms/functional.py", line 209, in normalize
raise TypeError('tensor is not a torch image.')
TypeError: tensor is not a torch image.

Mona Köhler · Answer 3 · Wed Jun 16 2021 22:52:56 GMT+0800 (China Standard Time)

Are you able to run inference_sample.py with the provided samples? Are your images successfully read? What is the datatype and the shape of the images before line 73 when the error is thrown?

18306125266 · Answer 4 · Thu Jun 17 2021 11:45:18 GMT+0800 (China Standard Time)

I can run inference_sample.py with the provided samples.  Is it related to bit depth? I just change the inference_samples.py  line59,60  run：     python inference_samples.py --dataset sunrgbd --ckpt_path ./trained_models/sunrgbd/r34_NBt1D.pth --depth_scale 1 --raw_depth Then ,there report errors Loaded checkpoint from ./trained_models/sunrgbd/r34_NBt1D.pth Traceback (most recent call last):   File "inference_samples.py", line 73, in <module>     sample = preprocessor({'image': img_rgb, 'depth': img_depth})   File "/home/admin/.conda/envs/rgbd_segmentation/lib/python3.7/site-packages/torchvision/transforms/transforms.py", line 70, in __call__     img = t(img)   File "/data/nas/workspace/jupyter/bisenetv2/ESANet-main/src/preprocessing.py", line 198, in __call__     mean=self._depth_mean, std=self._depth_std)(depth)   File "/home/admin/.conda/envs/rgbd_segmentation/lib/python3.7/site-packages/torchvision/transforms/transforms.py", line 175, in __call__     return F.normalize(tensor, self.mean, self.std, self.inplace)   File "/home/admin/.conda/envs/rgbd_segmentation/lib/python3.7/site-packages/torchvision/transforms/functional.py", line 209, in normalize     raise TypeError('tensor is not a torch image.') TypeError: tensor is not a torch image. The experience exchange paste said that the order of functions caused this error. I referred to this method, but it didn't work. How can i test my data?Thank you!

Daniel Seichter · Answer 5 · Thu Jun 17 2021 21:45:16 GMT+0800 (China Standard Time)

If you are able to run inference_sample.py with the samples provided by us, the problem seems to be related to your images. Please check that both images are loaded correctly using a breakpoint at line 70. OpenCV is returning None if loading fails without throwing any error.

Daniel Seichter · Answer 6 · Thu Jun 17 2021 21:53:31 GMT+0800 (China Standard Time)

Beyond that, as already mentioned by Mona, we need the dtypes and shapes for both images at this line for further debugging.

18306125266 · Answer 7 · Fri Jun 18 2021 11:03:47 GMT+0800 (China Standard Time)

The provided image_rgb  shape (424,512,3) dtype uint8      image_depth(424,512) dtype float32 my image_rgb shape(424,512,3)  dtype uint8     image_depth (424,512,3) dtype float32 Is  that the problem?Thank you! These are my images.

Daniel Seichter · Answer 8 · Fri Jun 18 2021 15:54:05 GMT+0800 (China Standard Time)

The problem is related to your depth image - is not a common depth image with depth values encoded in one channel as yours has three channels. It is more like another RGB images with gray values encoding the depth. You should check the depth image.

18306125266 · Answer 9 · Sat Jun 19 2021 14:47:52 GMT+0800 (China Standard Time)

OK，thank you very much!  

18306125266 · Answer 10 · Thu Jun 24 2021 14:46:22 GMT+0800 (China Standard Time)

I get the result .Thanks for your help! Now,there have a new question.How can I output semantic information corresponding to different color regions?

Mona Köhler · Answer 11 · Thu Jun 24 2021 15:34:19 GMT+0800 (China Standard Time)

What do you mean with "different color regions"?

18306125266 · Answer 12 · Fri Jun 25 2021 09:04:57 GMT+0800 (China Standard Time)

 For example ,the orange area refers to the "table",How can i output the information "table"?   

Mona Köhler · Answer 13 · Tue Jun 29 2021 16:07:31 GMT+0800 (China Standard Time)

Before coloring (https://github.com/TUI-NICR/ESANet/blob/main/inference_samples.py#L87), the segmentation contains integers. Each integer refers to one category. For each category there exists a color and a class name as defined here. If you only need the regions for category "table" you can filter the segmentation by the respective integer value.

Shivam Kumar · Answer 14 · Wed Feb 08 2023 07:35:39 GMT+0800 (China Standard Time)

The problem is related to your depth image - is not a common depth image with depth values encoded in one channel as yours has three channels. It is more like another RGB images with gray values encoding the depth. You should check the depth image.

I too faced the same issue as third dimension seems to be not encoded properly...so I did some manipulation and it worked