cvdfoundation / kinetics-dataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Kinetics Datasets Downloader

Kinetics is a collection of large-scale, high-quality datasets of URL links of up to 650,000 video clips that cover 400/600/700 human action classes, depending on the dataset version. The videos include human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands and hugging. Each action class has at least 400/600/700 video clips. Each clip is human annotated with a single action class and lasts around 10 seconds.

The Kinetics project publications can be found here: https://deepmind.com/research/open-source/kinetics.

Note that it may not be safe to train / test across different versions of the dataset, for example the k400 validation set is largely part of the k700 training set so results will not be meaningful.

Updates

5th of May: fixed k400/train/part_120.tar.gz, it was a tar file before

10th of December: add two downloader scripts for datasets automatic setup. (k400_downloader.sh and k700_2020_downloader.sh)

13th of September. K400: replaced corrupted mountain climber validation file and made available 1300+ replacement videos for existing corrupted training videos from various classes. K600: added list of links for held out test set.

4th of August 2021 -- replaced corrupted videos in the kinetics-700-2020 test set (these were typically shorter than 9s as well). There are still 5% of the videos in the test set that are shorter than 9s, but genuinely so (e.g. because they are like that in youtube).

Download Videos

CVDF currently hosts the videos in the Kinetics-400 and Kinetics-700-2020 datasets.

Kinetics-400

Kinetics-400 Download:

Clone repo and enter directory
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
Download tar gzip files

This will create two directories, k400 and k400_targz. Tar gzips will be in k400_targz, you can delete k400_targz directory after extraction.

bash ./k400_downloader.sh
Extract tar gzip files
bash ./k400_extractor.sh

Kinetics-400 Info:

The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/400/train/k400_train_path.txt

https://s3.amazonaws.com/kinetics/400/val/k400_val_path.txt

https://s3.amazonaws.com/kinetics/400/test/k400_test_path.txt

It is easy to obtain a specific split (e.g. train), by:

bash download.sh k400_train_path.txt

Then, extract *.tar.gz files by:

bash extract.sh k400_train_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/400/annotations/train.csv

https://s3.amazonaws.com/kinetics/400/annotations/val.csv

https://s3.amazonaws.com/kinetics/400/annotations/test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/400/readme.md

News: users found ~1400 corrupted videos. A replacement for the vast majority can be found here:

https://s3.amazonaws.com/kinetics/400/replacement_for_corrupted_k400.tgz

Kinetics-600

Kinetics-600 Download:

Clone repo and enter directory
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
Download tar gzip files

This will create two directories, k600 and k600_targz. Tar gzips will be in k600_targz, you can delete k600_targz directory after extraction.

bash ./k600_downloader.sh
Extract tar gzip files
bash ./k600_extractor.sh

Kinetics-600 Info:

The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/600/train/k600_train_path.txt

https://s3.amazonaws.com/kinetics/600/val/k600_val_path.txt

https://s3.amazonaws.com/kinetics/600/test/k600_test_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/600/annotations/train.txt

https://s3.amazonaws.com/kinetics/600/annotations/train.csv (incomplete)

https://s3.amazonaws.com/kinetics/600/annotations/val.txt

https://s3.amazonaws.com/kinetics/600/annotations/val.csv (incomplete)

https://s3.amazonaws.com/kinetics/600/annotations/test.csv

https://s3.amazonaws.com/kinetics/600/annotations/kinetics600_holdout_test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/600/readme.md

Kinetics-700-2020

Kinetics-700-2020 Download:

Clone repo and enter directory
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
Download tar gzip files

This will create two directories, k700-2020 and k700-2020_targz. Tar gzips will be in k700-2020_targz, you can delete k700-2020_targz directory after extraction.

bash ./k700_2020_downloader.sh
Extract tar gzip files
bash ./k700_2020_extractor.sh

Kinetics-700-2020 Info:

The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/700_2020/train/k700_2020_train_path.txt

https://s3.amazonaws.com/kinetics/700_2020/val/k700_2020_val_path.txt

https://s3.amazonaws.com/kinetics/700_2020/test/k700_2020_test_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/700_2020/annotations/train.csv

https://s3.amazonaws.com/kinetics/700_2020/annotations/val.csv

https://s3.amazonaws.com/kinetics/700_2020/annotations/test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/700_2020/K700_2020_readme.txt

Downstream annotations

We also host annotations for AVA-Kinetics and Countix, which both use Kinetics-700 videos.

To download annotations for AVA-Kinetics: https://s3.amazonaws.com/kinetics/700_2020/annotations/ava_kinetics_v1_0.tar.gz

To download annotations for countix: https://s3.amazonaws.com/kinetics/700_2020/annotations/countix.tar.gz

About


Languages

Language:Shell 79.2%Language:Python 20.8%