felix-berlin / domianarchiv-downloader

A Python script to download audio files from domianarchiv.de

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Domian Archiv Downloader

This script is used to download audio files from the website https://domianarchiv.de.

Requirements

  • Python 3
  • BeautifulSoup4
  • requests

You can install these packages using pip:

pip install beautifulsoup4 requests

Usage

You can run the script using the following command:

python domianarchiv-downloader.py --start_page START_PAGE --end_page END_PAGE --download_dir DOWNLOAD_DIR

Where:

START_PAGE is the page number to start downloading from (default is 0). END_PAGE is the page number to stop downloading at (default is 184). DOWNLOAD_DIR is the directory where the downloaded files will be saved (default is "downloads").

How it works

The script uses BeautifulSoup to parse the HTML of the website and find the links to the audio files. It then downloads these files using the requests library. The script handles HTTP 503 errors by retrying failed requests and adding a delay between requests to avoid overloading the server. It also includes a User-Agent string in the request headers, which some servers require.

Disclaimer

Please respect the rights of the content creators and only download the files if you have permission to do so.

About

A Python script to download audio files from domianarchiv.de

License:GNU Affero General Public License v3.0


Languages

Language:Python 100.0%