Inference speed is too slow, how to optimize it and accelerate inference?

Question

SkylerZheng opened this issue a year ago · comments

Hi, I used HPSv2 on A100(1 single GPU), it's about 21-23 seconds per run. Are there anyway we can try to improve the latency?

Blakey Wu · Answer 1 · Tue Sep 12 2023 08:32:13 GMT+0800 (China Standard Time)

Do you have a minimal script to reproduce the issue?