Does XNNPACK support concurrent runs?

Question

Does XNNPACK support concurrent runs?

snnn opened this issue 2 years ago · comments

Like this: https://github.com/tensorflow/tensorflow/blob/v1.10.1/tensorflow/cc/tutorials/example_trainer.cc, which create a session from a model, then use multiple threads to invoke Session::Run() concurrently. The Session::Run() function is thread-safe. A session object can be shared by multiple threads.

From XNNPACK's API, it seems hard to achieve. For example, let's say we have a model with only one conv node. To run the conv node, we need to do 3 steps:

Create an operator with weights: xnn_create_convolution2d_nhwc_f32
Setup the operator with input/output buffers: xnn_setup_convolution2d_nhwc_f32
Run the operator

Prepacking is in step 1. If we have multiple threads ,we will need multiple operators , therefore multiple copies of the weights. Is it the case?

Marat Dukhan · Answer 1 · Mon Feb 07 2022 02:48:09 GMT+0800 (China Standard Time)

Correct. Concurrent inferences on the same operator are not supported.