coreylowman / cudarc

Safe rust wrapper around CUDA toolkit

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Use default stream instead of a new stream per CudaDevice

coreylowman opened this issue · comments

Instantiating multiple devices creates multiple streams, and there is tricky synchronization problems between them.

is it generally a problem to synchronize multiple devices?

and isn't the default stream non-async (but only for non-explicit async ops?)

By device in description I meant cudarc::CudaDevice. Each of these will have its own stream at the moment