Is there a significant difference in training results for the SeperationBlock and SeperationBlock_stateful?

Question

Is there a significant difference in training results for the SeperationBlock and SeperationBlock_stateful?

c8x1 opened this issue 2 years ago · comments

I saw that you have written two forms of SeperationBlock. The only difference between them is whether to return the hidden state out. What's the point of doing this? And which implementation has better performance? thank you for your response.

lhwcv · Answer 1 · Tue Jul 05 2022 15:58:05 GMT+0800 (China Standard Time)

@c8x1 "hidden state out" is for realtime enhance, which process trunk by trunk, it has better performance when in inference

Noc · Answer 2 · Tue Jul 05 2022 16:02:25 GMT+0800 (China Standard Time)

看到你好像是**的，那就不用蹩脚英语了 :D.

所以stateful的就只是用来推理的咯？

我之前也有试过把hidden传出来的训练方式，目的是把这batch的hidden state作为下一个batch的hidden state初始值，避免训练的片段比较短，实际推理又比较长的RNN泛化问题，跟这个是两回事是吧？

lhwcv · Answer 3 · Tue Jul 05 2022 16:03:37 GMT+0800 (China Standard Time)

对的，应该是两个问题。

Noc · Answer 4 · Tue Jul 05 2022 16:05:17 GMT+0800 (China Standard Time)

感谢哈，你的模型实现简洁易懂！

lhwcv · Answer 5 · Tue Jul 05 2022 16:07:34 GMT+0800 (China Standard Time)

@c8x1 谢谢，我们有问题再交流