jasonppy / yt-dl

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

yt-dl

This project implements a massively parallel downloader based on the yt-dlp fork of youtube-dl. We avoid throttling/ip blacklisting by downloading the videos through remote servers. Using a large number of servers ensures that no individual machine downloads too frequently. Currently the tool is geared towards downloading AudioSet data.

Dependencies

In addition, you should have passowordless ssh privileges to servers.

Usage

python3 yt-dl.py [OPTIONS]

Options

--data                           AudioSet data csv location
--ontology                       AudioSet ontology json location
--num-workers                    Number of downloading processes
--num-postprocessers             Number of postprocessor processes
--user                           Username on remote servers
--domain                         Domain of remote servers
--hostnames                      Hostnames of remote servers
--tmp                            Temporary storage location
--out                            Output directory
--exp_dir                        Logfile directory

About


Languages

Language:Python 100.0%