video-to-pptx
A video-to-PPTX-slides converter whose main goal is to convert scientific presentation to PPTX slides.
N.B.: The converter only supports videos hosted on YouTube for now.
The conversion process is roughly nine-fold:
- The video is downloaded in memory or to file storage with the Pytube package;
- A sequence of frames is extracted from the video;
- A sequence of captions is extracted from the video;
- Too similar adjacent frames are filtered out, as they likely coincide with longer explanations by the author;
- If any frames have been filtered out, we might have to join some of the captions to align the sequences;
- The frames in the reduced sequence are converted to SVG to infer their structure;
- The SVG frames are converted to unpolished PPTX slides with an SVG-to-PPTX converter;
- The unpolished PPTX slides are better formalized with the Python-PPTX package*;
- The captions are attached as speaker notes (NotesSlide objects) to their corresponding PPTX slide.
*An example of creating a PPTX presentation with Python-PPTX.