fuankarion / active-speakers-context

Code for the Active Speakers in Context Paper (CVPR2020)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Capturing audio segment from clip

okankop opened this issue · comments

Hi, to capture the audio part from the clip, you are subtracting the audio_offset at the calculation of audio_start and audio_end at the L103. Is it correct way of capturing audio part? I believe, we should not subtract audio_offset.

Hi, we need the audio offset because the timestamp in the ground-truth file localizes the audio event in the full video, but is not suitable for the audio clips required by the STE. The offset contains the beginning timestamp of the audio clip this allows us to map the timestamp in the ground-truth file to the same location in the clip by subtracting the offset,