roatienza / efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ONNX models?

fquirin opened this issue Β· comments

I've tried efficientspeech on my Raspberry Pi 4 and it works pretty well (~2s for 3s audio) πŸ‘, but it still needs to be a bit faster to be really useful.
In your code I've seen a comment about the ONNX models being ~3 times faster.
I failed to use the convert script so I was wondering if you could upload the model for testing? πŸ™‚

ONNX conversion now supported.
There are certain limitations though like dynamic axis is not fully compatible with certain ops in feature upsampling.

Ty for the update. I'm getting a few warnings during code execution, but it seems to work and it looks like the ONNX version is about twice as fast 😎 πŸ‘. Will do some more tests as soon as I find time.

Btw your RTF definition is flipped, faster than RT usually is <1 πŸ˜‰: RTF