A potential bug

Question

A potential bug

tomato18463 opened this issue 9 months ago · comments

Hi

I used part of your code in my work, and I find a potential bug (I have not run your original code though). Please can you give it a check? Specifically, this line pads the audio data if its length is smaller than 640 times the corresponding video data length. And this line says the variable data has a size of Tx1, so the torch.nn.functional.pad function in this line will result a output size of Tx(1+padding). This seems incorrect to me. I think the padding result is supposed to be (T+padding)x1, and this line may need to be changed to something like torch.nn.functional.pad(data, (0, 0, 0, padding), "constant"). I know I may be wrong as I have not run your original code. Please can you check it anyway?

Thanks!

Pingchuan Ma · Answer 1 · Mon Dec 11 2023 19:18:24 GMT+0800 (China Standard Time)

Hi @tomato18463, I have fixed the bug. Thank you for pointing out this! #20