AI_TADA-program-segmentation

Introduction

This repository entails the work for the program segmentation part of the project "AI-TADA: Automated indexing of historical television data". A collaboration of Utrecht University and Beeld and Geluid institute (https://www.uu.nl/en/research/ai-labs/ai-media-lab/projects/ai-tada-automated-indexing-of-historical-television-data).

This repo is about the program segmentation. Specifically, it is aimed at segmenting programs from an input video stream.

The general flow of the project is:

Generate shot boundary detections (SBDs) from a video stream.
Segment the input video into shot videos, based on the SBD files.
Extract keyframes for each shot video. (optional)
Calculate features of each (key)frame.
Use LDA/fisher score to maximize between-class variance as to distinguish program boundaries.

The project also includes scripts for data annotation: converting shot boundaries to annotations and vice versa.

Installation

Create anaconda environment with Python 3.11.5:

conda create --name AI_TADA python=3.11.5

Activate environment:

conda activate AI_TADA

Clone AI-TADA directory:

git clone https://github.com/ivarfresh/AI_TADA-program-segmentation.git

Go to directory:

cd {root_directory}/AI_TADA-program-segmentation

Install requirements.txt (does not include packages required for TransNetV2):

pip install -r requirements.txt

Install the TransNetV2 model for SBD. Follow these steps:

https://github.com/soCzech/TransNetV2/tree/master/inference

After installation, your root folder should look like this: root tree

You will not have the video data yet. Ask your supervisors about this. Make sure your data folder has the following tree strucutre: data tree

Usage

General project

If you work on Mac or linux, run "xmf_to_mp4.py". This converts all videos to ".mp4". If you work Windows, you can use ".xmf".
Run "TransNet_all_videos.py" in order to run the TransNetV2 model on all videos in a folder. See TransnetV2 run instructions, to run the model via terminal.
Run "LDA_pipeline/MoviePy_segmentation.py" (segment the video into shot videos)
Run "LDA_pipeline/keyframe_FFMPEG.py" (generate keyframes)
Run "LDA_pipeline/color_hists.py" (calculate color histograms)
Run "LDA_pipeline/fisher_score.py" (calculate fisher score with sliding window) or "LDA_pipeline/LDA_hist.py" (calculate sklearn LDA with sliding window).

Shots to annotations

If you want to turn shots into annotations, run "shots_to_annotations.py"
If you want to turn programs annotations into program segmentation vector, run "read_elan_file.py"
If you want to turn programs annotations into shot segmentation vector, run "read_elan_file_shots.py"

About

This repository entails the work for the program segmentation part of the project "AI-TADA: Automated indexing of historical television data". A collaboration of Utrecht University and Beeld and Geluid institute.

MIT License

Languages

Language:Python 100.0%