There is a writing error in the paper
zchuz opened this issue · comments
Zheng Chu commented
There is a writing error in the paper
The batchsize in the pre-training and refinement phase should be 16 per gpu instead of 64 per gpu.
Shoubin commented
For pre-training and self-refinement, we sample 4 frames from each video samples, and calculate gradients on reshaped batch. The batch size on each GPU is 4x16, consistent with the specifications outlined in the paper. Thanks for your suggestion, we will clarify this later.
Zheng Chu commented
Thank you for your reply, I did not take into account that each video samples four frames.