cannot compile `nn.Sequential` into float32

Question

cannot compile `nn.Sequential` into float32

wcshds opened this issue 2 months ago · comments

When I wrap nn.Linear in nn.Sequential, it fails to compile properly into a float32 model.

import intel_npu_acceleration_library
import torch
from torch import nn

model = nn.Sequential(
    nn.Linear(128, 512)
)
print(model)
model = intel_npu_acceleration_library.compile(model, dtype=torch.float32)

input  = torch.randn((4, 128))
model(input)

Alessandro Palla · Answer 1 · Tue Jun 11 2024 06:09:05 GMT+0800 (China Standard Time)

HI, float32 is not a supported dtype for now. However for performance point of view I suggest you to use float16 or quantized datatypes.

WU Chen · Answer 2 · Tue Jun 11 2024 10:19:23 GMT+0800 (China Standard Time)

HI, float32 is not a supported dtype for now. However for performance point of view I suggest you to use float16 or quantized datatypes.

Thank you. But I notice that float32 is used in the example train_mnist.py. Is this a typo?

Alessandro Palla · Answer 3 · Wed Jun 12 2024 02:59:18 GMT+0800 (China Standard Time)

float32 dtype is mostly used for training API that is still quite experimental. For pure inference you should go for float16 or lower dtype to fully utilize NPU acceleration

WU Chen · Answer 4 · Wed Jun 12 2024 06:35:42 GMT+0800 (China Standard Time)

float32 dtype is mostly used for training API that is still quite experimental. For pure inference you should go for float16 or lower dtype to fully utilize NPU acceleration

Thanks for your explanation.