There are 5 repositories under video-dataset topic.
Video Foundation Models & Data for Multimodal Understanding
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
SoccerAct10 is a dataset which contains 10 different soccer actions. This dataset was developed using the videos from YouTube.
Awesome papers & datasets specifically focused on long-term videos.
Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.
Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera like position.
:seedling: Starter kit for working with the EPIC-KITCHENS-55 dataset for action recognition or anticipation
Official repository for the paper titled "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method", accepted by NeurIPS 2023 Dataset and Benchmark Track
Official This-Is-My Dataset published in CVPR 2023
Improving Transfer Learning with a Dual Image and Video Transformer for Multi-label Movie Trailer Genre Classification
The repository contains the code for extracting image and mask from a video segmentation dataset by using the OpenCV library in the Python programming language.
Dataset repository of "MetaVD: A Meta Video Dataset for enhancing human action recognition datasets."
Synthetically Generated Surveillance Perspective Human Action Recognition Dataset: 6901 Videos from 10 action classes, made by a 3D Simulation, all cropped spatio-temporally and filmed from a surveillance-camera like position.
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️📽️ The video category for AI2001, containing video datasets
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️📽️▶️ The video:animation category for AI2001, containing animation video datasets
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️📽️👁️ The video:animation:anime category for AI2001, containing Aime video datasets
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️📽️🕹️ The video:gameplay category for AI2001, containing gameplay video datasets
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️📽️🌳️ The video:nature category for AI2001, containing nature video datasets
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️📽️📸️ The video:photography category for AI2001, containing photography video datasets
This annotation tool is build to clean and create video dataset.
UG2: a Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition
Trailers12k is a video movie trailer dataset composed of 12,000 titles associated to 10 genres. It distinguishes from other datasets by its collection procedure aiming to provide a high-quality publicly available dataset.
the frame extractor for Video Datasets with GPU Acceleration