[Performance] QNN EP Leaving QDQ Nodes in the QNN Graph
kory opened this issue · comments
Kory Watson commented
Describe the issue
The QNN EP appears to be leaving QDQ nodes in the QNN graph for Mul and Sub, rather than inserting quantized ops. I have included a sample model below.
To reproduce
See attached model.
Affects Inception_v3 among other models.
Urgency
No response
Platform
Linux
OS Version
Ubuntu 22.04
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
Latest
ONNX Runtime API
Python
Architecture
ARM64
Execution Provider
QNN EP
Execution Provider Library Version
QNN 2.20
Model File
Is this a quantized model?
Yes