cvdfoundation / kinetics-dataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question Regarding kinetics-400 Dataset: What are test videos?

dkmv0623 opened this issue · comments

Hello, I'm new to the field of Action recognition and have a question regarding the dataset split. Specifically for the kinetics-400 dataset, in the paper "Unmasked Teacher: Towards Training-Efficient Video Foundation Models," they provide the following summary for the number of training and validation data:
unmasked_teacher

In the Video Swin Transformer paper, they also describe the kinetics-400 dataset as follows:
video_swin

Both papers commonly state that kinetics-400 consists of approximately 240k training videos and 20k validation videos. However, the CSV file provided in this GitHub repository contains around 40k test videos that are not mentioned in the papers. Could you please clarify what are these test videos?

Additionally, the link to https://deepmind.com/research/open-source/kinetics is not working correctly. Has the official project page been removed?

I would appreciate insights from those who have continued their research in the field of Action Recognition, and familiar with the kinetics dataset.

Hi, the dataset has a validation set and a test set. Both are for testing, but validation was designed for people to iterate on when publishing (it was public so people could compute performance). The test set was held out, so only the kinetics dataset creators could compute performance of a method -- this was useful for building competitions.

The website at deepmind.com was removed when the website was updated (i should re-create it somewhere when i have time).

Hello @joaoluiscarreira thanks for this clarification.