ONNX models?

Question

ONNX models?

fquirin opened this issue a year ago · comments

I've tried efficientspeech on my Raspberry Pi 4 and it works pretty well (~2s for 3s audio) 👍, but it still needs to be a bit faster to be really useful.
In your code I've seen a comment about the ONNX models being ~3 times faster.
I failed to use the convert script so I was wondering if you could upload the model for testing? 🙂

Rowel Atienza · Answer 1 · Wed May 17 2023 21:45:01 GMT+0800 (China Standard Time)

ONNX conversion now supported.
There are certain limitations though like dynamic axis is not fully compatible with certain ops in feature upsampling.

Florian Quirin · Answer 2 · Fri May 19 2023 17:44:33 GMT+0800 (China Standard Time)

Ty for the update. I'm getting a few warnings during code execution, but it seems to work and it looks like the ONNX version is about twice as fast 😎 👍. Will do some more tests as soon as I find time.

Btw your RTF definition is flipped, faster than RT usually is <1 😉: RTF