CUDA -> CPU issue

Question

CUDA -> CPU issue

kamwoh opened this issue 5 years ago · comments

def drop_connect(inputs, p, training):
    """ Drop connect. """
    if not training: return inputs
    batch_size = inputs.shape[0]
    keep_prob = 1 - p
    random_tensor = keep_prob
    random_tensor += torch.rand([batch_size, 1, 1, 1], dtype=inputs.dtype)  # uniform [0,1)
    binary_tensor = torch.floor(random_tensor)
    output = inputs / keep_prob * binary_tensor # error happens here
    return output

Faced error: RuntimeError: expected backend CUDA and dtype Float but got backend CPU and dtype Float

when I try to run on GPU, this error happens, the error direct me to this line, I think we should convert binary_tensor to inputs.device:

binary_tensor = torch.floor(random_tensor).to(inputs.device)

dami23 · Answer 1 · Thu Jun 20 2019 20:54:41 GMT+0800 (China Standard Time)

Hi kamwoh, do you use the efficientnet to train your own model or use pretrained models? If you use the pretrained models, the newly released version demands to add model.eval() after the model load.

Ng Kam Woh · Answer 2 · Thu Jun 20 2019 21:04:53 GMT+0800 (China Standard Time)

Hi @dami23 , using pretrained models to finetune, yes I did model.eval()

Luke Melas-Kyriazi · Answer 3 · Fri Jun 21 2019 02:19:01 GMT+0800 (China Standard Time)

Yup, this is now fixed in master. See #29
I'll push out another release to pip soon.