about temporal merging

Question

about temporal merging

xxtars opened this issue 6 months ago · comments

Thank you very much for making your work open source!

I have some questions while reading the paper. How do you ensure that the frames $f^m$ within an event $$ are continuous after clustering frame-level features? Is there any algorithmic constraint for this? I didn't seem to find a related description in the paper or code.

Looking forward to your reply!

Peng Jin · Answer 1 · Tue Mar 19 2024 18:09:45 GMT+0800 (China Standard Time)

Thank you for bringing up this issue. In our algorithmic, we don't strictly require frames to be adjacent. However, this flexibility can indeed disrupt the video's timing. Do you have any suggestions on how we can address this?