what is the input when inference for encoding?

Question

Edwardmark opened this issue a month ago · comments

what is the input when inference for encoding? I think only raw audio is the input, no stft or mel spectrum is needed for inference, is that right?

ZhangXin · Answer 1 · Wed May 22 2024 17:06:57 GMT+0800 (China Standard Time)

Yes, it is right。

Edwardmark · Answer 2 · Wed May 22 2024 17:17:21 GMT+0800 (China Standard Time)

@ZhangXInFD Thanks for your quick and helpful reply. Your work is really great!