HPCToolkit / hpctoolkit

HPCToolkit performance tools: measurement and analysis components

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Configurable wait time for operation channels

Jokeren opened this issue · comments

There was a channel wait call after the following line.

gpu_operation_channel_set_process(current_operation_channels_count);

When using cupti continuous pc sampling, we use the operation channel to coordinate application threads and the background thread to attribute pc samples. If the operation channel is called very frequently without waiting, it could cause up to 4x slowdown on ufront.

Therefore, I propose to add a control knob for the wait time to reduce overhead in certain cases. By default the wait time can be 0.

Seems like the problem comes from CUPTI pc sampling's bug but not our operation thread.