why '/dev/nvidia-uvm' is a required file check for nvidia GPU?

Question

why '/dev/nvidia-uvm' is a required file check for nvidia GPU?

LiweiPeng opened this issue 3 years ago · comments

What's the issue, what's expected?:
This is v0.6.0. In (superbench code gpu.py) [(https://github.com/microsoft/superbenchmark/blob/main/superbench/common/devices/gpu.py#L24), it checks whether a GPU is nvidia by checking both '/dev/nvidiactl' and '/dev/nvidia-uvm'. The question is: why does it require /dev/nvidia-uvm? I found some GPU type like 'Tesla K80' with cuda 11.4 doesn't have this.

How to reproduce it?:
Logon to a machine with 'Tesla K80' + cuda 11.4.

Yifan Xiong · Answer 1 · Sat Sep 10 2022 19:17:34 GMT+0800 (China Standard Time)

It's required when using gpus in docker (here's an example), and should be included as one step during nvidia driver installation

If you cannot find /dev/nvidia-uvm in your system, you can run following commands to create it accroding to above document:

/sbin/modprobe nvidia-uvm
D=`grep nvidia-uvm /proc/devices | awk '{print $1}'`
mknod -m 666 /dev/nvidia-uvm c $D 0

Liwei Peng · Answer 2 · Tue Sep 20 2022 22:14:43 GMT+0800 (China Standard Time)

Because K80 GPU is not supported by superbench, it is OK not to support it.