RuntimeError: output with shape [1, 28, 28] doesn't match the broadcast shape [3, 28, 28]

Question

RuntimeError: output with shape [1, 28, 28] doesn't match the broadcast shape [3, 28, 28]

houchenyu opened this issue 5 years ago · comments

When I run your code, I have met a error when executing data_source = data_source_iter.next():

RuntimeError: output with shape [1, 28, 28] doesn't match the broadcast shape [3, 28, 28].

I have no idea why this error occurs. Could you please give some suggestions.
Besides, I'm using Python3.6 and Pytorch 1.0. My operating system is Ubuntu 16.04. Thanks.

fungtion · Answer 1 · Fri Mar 08 2019 16:01:10 GMT+0800 (China Standard Time)

I have not tested it on pytorch 1.0 yet, but I have transfrom the gray images into 3 channels, can you post the complete errors here?

houchenyu · Answer 2 · Fri Mar 08 2019 17:23:18 GMT+0800 (China Standard Time)

I have not tested it on pytorch 1.0 yet, but I have transfrom the gray images into 3 channels, can you post the complete errors here?

But there is no error on Windows10 with the same version of pytorch.

fungtion · Answer 3 · Fri Mar 08 2019 17:26:30 GMT+0800 (China Standard Time)

I've no idea about it, and I'll test it with pytorch 1.0 on Ubuntu soon.

houchenyu · Answer 4 · Fri Mar 08 2019 17:26:36 GMT+0800 (China Standard Time)

Traceback (most recent call last):
File "/home/hou/桌面/DANN-master/train/main.py", line 98, in
data_source = data_source_iter.next()
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 637, in next
return self._process_next_batch(batch)
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 658, in process_next_batch
raise batch.exc_type(batch.exc_msg)
RuntimeError: Traceback (most recent call last):
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 138, in worker_loop
samples = collate_fn([dataset[i] for i in batch_indices])
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 138, in
samples = collate_fn([dataset[i] for i in batch_indices])
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torchvision/datasets/mnist.py", line 95, in getitem
img = self.transform(img)
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torchvision/transforms/transforms.py", line 60, in call
img = t(img)
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torchvision/transforms/transforms.py", line 163, in call
return F.normalize(tensor, self.mean, self.std, self.inplace)
File "/home/hou/anaconda3/envs/py36/lib/python3.6/site-packages/torchvision/transforms/functional.py", line 208, in normalize
tensor.sub(mean[:, None, None]).div(std[:, None, None])
RuntimeError: output with shape [1, 28, 28] doesn't match the broadcast shape [3, 28, 28]

houchenyu · Answer 5 · Fri Mar 08 2019 17:28:18 GMT+0800 (China Standard Time)

This is the complete error info. It seems wrong about dataloader

fungtion · Answer 6 · Fri Mar 08 2019 17:33:11 GMT+0800 (China Standard Time)

I guess something wrong here, some images were not converted to 3 channels as rgb.

fungtion · Answer 7 · Fri Mar 08 2019 17:45:59 GMT+0800 (China Standard Time)

I built these codes on pytorch 0.4.0, python 2.7, ubuntu 16.04, this could make any difference? I will test it soon.

houchenyu · Answer 8 · Fri Mar 08 2019 17:47:45 GMT+0800 (China Standard Time)

Maybe it is not. According to the error info, there is a error about the source data loader, which is implemented by in-place function. Therefore, I think this error is not caused by convertion.

fungtion · Answer 9 · Fri Mar 08 2019 17:56:10 GMT+0800 (China Standard Time)

This error was caused by the shape mismatch between the tensor and self.mean in F.normalize, which tensor was [1,28,28] and self.mean was [0.5, 0.5, 0.5], so the shape of self.mean implied that the tensor should be [3, *, *], instead of [1, *, *]. So I think there is something wrong with this input tensor.

houchenyu · Answer 10 · Sat Mar 09 2019 12:52:53 GMT+0800 (China Standard Time)

mnist dataset consists of grey images which are 1 channel. Thus the input tensor is [1, *, *]. Indeed, you have implemented the self-defined loader GetLoader where you convert the input images to RGB. But this loader is used to load mnist_m dataset rather than mnist dataset. Therefore, there is nothing to do with mnist dataset. And input tensor of mnist is still [1, *, *].

houchenyu · Answer 11 · Sat Mar 09 2019 13:41:16 GMT+0800 (China Standard Time)

Ooooooh, finally, I run the code successfully. I add a new transform like this

img_transform1 = transforms.Compose([
    transforms.Resize(image_size),
    transforms.ToTensor(),
    transforms.Lambda(lambda x: x.repeat(3,1,1)),
    transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5))
])

And I replace the original transform of source data mnist with img_transform1. By do so, the mnist images can be converted to [3, *, *] tensors. I am curious about why the code is OK under your settings.
In my opinion, is there a difference about torchvision.datasets.MNIST() function between torch 0.4 and torch 1.0???

fungtion · Answer 12 · Sun Mar 10 2019 22:13:15 GMT+0800 (China Standard Time)

Yes, you're right, input tensor of mnist is still [1, *, *]. The problem comes pytorch/vision@2115380#diff-fc1f220b470714d05cf3ea6acf9fed59L204, where it used zip to return iterable object before, and If multiple iterables are passed, iterator stops when shortest iterable is exhaused, I think this is the reason that it ran well in my settings. After the zip was replaced by broadcasting, my codes failed because of shape mismatch. I will fix it soon, thank you.

Frank · Answer 13 · Tue Apr 16 2019 15:37:11 GMT+0800 (China Standard Time)

add transforms.CenterCrop(), after transforms.Resize .

# load data
transform = transforms.Compose([
    transforms.Resize(224),
    transforms.CenterCrop(224),
    transforms.ToTensor(),
    transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5)),
])

maopig · Answer 14 · Sat Apr 27 2019 15:28:59 GMT+0800 (China Standard Time)

Let me clarify, if the img has three channels, you should have three number for mean, for example, img is RGB, mean is [0.5, 0.5, 0.5], the normalize result is Rx0.5, Gx0.5, Bx0.5. If img is grey type that only one channel, so mean should be [0.5], the normalize result is R*0.5

ShuuTsubaki · Answer 15 · Wed May 08 2019 10:48:55 GMT+0800 (China Standard Time)

Ooooooh, finally, I run the code successfully. I add a new transform like this
img_transform1 = transforms.Compose([
    transforms.Resize(image_size),
    transforms.ToTensor(),
    transforms.Lambda(lambda x: x.repeat(3,1,1)),
    transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5))
])
And I replace the original transform of source data mnist with img_transform1. By do so, the mnist images can be converted to [3, *, *] tensors. I am curious about why the code is OK under your settings.
In my opinion, is there a difference about torchvision.datasets.MNIST() function between torch 0.4 and torch 1.0???

transforms.Lambda(lambda x: x.repeat(3,1,1)),
transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5))

I received the fault , Can't pickle local object 'get_data_loaders..'.

fungtion · Answer 16 · Fri May 10 2019 09:49:41 GMT+0800 (China Standard Time)

@ShuuTsubaki try new code under pytorch 1.0

priteshgohil · Answer 17 · Tue Jun 04 2019 06:29:01 GMT+0800 (China Standard Time)

what should be the image_size in above code? I have image of 28x28.
When i set image size to 784, i get the following error.

RuntimeError: size mismatch, m1: [128 x 1843968], m2: [784 x 128] at /opt/conda/conda-bld/pytorch_1556653099582/work/aten/src/TH/generic/THTensorMath.cpp:961

fungtion · Answer 18 · Tue Jun 04 2019 10:18:52 GMT+0800 (China Standard Time)

@priteshgohil image_size = 28

Vivek Chandra · Answer 19 · Wed Jun 26 2019 22:52:12 GMT+0800 (China Standard Time)

Downgrading torch and torchvision to 0.2.0 and 0.2.1 solved this issue for me.

Han Zijia · Answer 20 · Tue Jul 23 2019 23:20:42 GMT+0800 (China Standard Time)

Downgrading torch and torchvision to 0.2.0 and 0.2.1 solved this issue for me.

Thanks So lot ! It helps me! BTW , Mine is pytorch 1.1.0 + win10 , work with torchvision 0.2.0

Mohana Medisetty · Answer 21 · Thu Dec 26 2019 16:31:29 GMT+0800 (China Standard Time)

I'm working on Ubuntu 16.04 version and using torch version 1.3.1
It's that MNIST data set consists of grey images.

transform = transforms.Compose([ transforms.ToTensor(), transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5)), ])

can be changed to

transform = transforms.Compose([transforms.ToTensor(), transforms.Normalize((0.5,), (0.5,))])

André Ourednik · Answer 22 · Thu Jan 23 2020 21:37:11 GMT+0800 (China Standard Time)

I'm working on Ubuntu 16.04 version and using torch version 1.3.1
It's that MNIST data set consists of grey images.

transform = transforms.Compose([ transforms.ToTensor(), transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5)), ])

can be changed to

transform = transforms.Compose([transforms.ToTensor(), transforms.Normalize((0.5,), (0.5,))])

This solved the problem for me

ferb2015 · Answer 23 · Sat Nov 21 2020 09:45:30 GMT+0800 (China Standard Time)

img_transform = transforms.Compose([
transforms.ToTensor(),
transforms.Normalize([0.5], [0.5])
])
dataset = MNIST('./data', transform=img_transform)