Converting models available in README of this repo to Engine File throws error

Question

Converting models available in README of this repo to Engine File throws error

SubhankarHalder opened this issue 3 years ago · comments

Hello,

I downloaded the Resnet 18 and Resnet 34 pth models available in the README.md of this Repo
I went through the list of commands to start the ODTK docker, namely:

git clone https://github.com/nvidia/retinanet-examples
docker build -t odtk:latest retinanet-examples/
docker run --gpus all --rm --ipc=host -it -v /home/vast/retinanet:/workspace/model odtk:latest

Then ran the following command inside the docker in the /workspace/model directory:

odtk export retinanet_rn18fpn.pth engine.plan

However, I got an error:

Loading model from retinanet_rn18fpn.pth...
     model: RetinaNet
  backbone: ResNet18FPN
   classes: 80, anchors: 9
Exporting to ONNX...
Building FP16 core model...
Segmentation fault (core dumped)

When running nvcc -V command:

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2020 NVIDIA Corporation
Built on Mon_Nov_30_19:08:53_PST_2020
Cuda compilation tools, release 11.2, V11.2.67
Build cuda_11.2.r11.2/compiler.29373293_0

With this command apt list | grep nvinfer

libnvinfer-bin/now 7.2.2-1+cuda11.1 amd64 [installed,local]
libnvinfer-dev/now 7.2.2-1+cuda11.1 amd64 [installed,local]
libnvinfer-plugin-dev/now 7.2.2-1+cuda11.1 amd64 [installed,local]
libnvinfer-plugin7/now 7.2.2-1+cuda11.1 amd64 [installed,local]
libnvinfer7/now 7.2.2-1+cuda11.1 amd64 [installed,local]

It seems linvinfer is working on cuda 11.1 but nvcc is 11.2
Could that be the reason? How do I fix this?

Deleted user · Answer 1 · Wed Aug 11 2021 07:05:02 GMT+0800 (China Standard Time)

Can you change this line to 21.06, and try again?

Subhankar Halder · Answer 2 · Wed Aug 11 2021 13:36:29 GMT+0800 (China Standard Time)

Thanks a lot! Your idea worked!