Input vector shape
saswat0 opened this issue · comments
How is the input vector shape (800,30)? Shouldn't it be (800,64) owing to the fact that we're extracting 64 dim fbank?
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
saswat0 opened this issue · comments
How is the input vector shape (800,30)? Shouldn't it be (800,64) owing to the fact that we're extracting 64 dim fbank?