mbharanya / SRF-Scraper

Scrape all episodes of an SRF show

Repository from Github https://github.commbharanya/SRF-ScraperRepository from Github https://github.commbharanya/SRF-Scraper

SRF Scraper

................................................................................
................................................................................
...........,:loddxxxxxddo:...'codddddddddoolc:,........,codddddddddddddoc,......
.........;d0NWMMMMMMMMMMNd'..;0WMMMMMMMMMMMWWX0d:......cKWMMMMMMMMMMMMMWKc......
........c0WMMMWXKKKKKKKKOc...:0MMMMWXKKKKKNWMMMWKc.....cXMMMMNXXXXXXXXXXk:......
.......,kWMMMXd;,,,,,,,,,....:0MMMM0:,,,,,:dXMMMWk,....cKMMMWk:;;;;;;;;;,.......
.......,kWMMMXo,.............:0MMMMO;......;0MMMW0;....cXMMMWx'.................
.......'lXMMMMN0kxol:,'......:0MMMWO;...',:xXMMMWx,....cXMMMW0ollllllll:'.......
........'l0NMMMMMMMWNKOd:'...:0WMMMO;.'lO0NWMMMNk;.....cXMMMMMWWWWWWWWWKc.......
..........,cdk0XNWWMMMMWXd,..:0WMMMO;.;0WWMMMNkc'......cXMMMMWNNNNNNNNN0:.......
..............',:cokXWMMMNo'.:0WMMMO;.,kNWMMMNk;.......cXMMMWOc::::::::;'.......
...................'dNMMMWx'.:0MMMMO;..;cxNMMMWO:......cXMMMWx'.................
........,:cc::::::co0WMMMXl..;0WMMMO;....,xNMMMW0l'....cXMMMWx'.................
.......,xNWWNNNNNNWWMMMWXo,..;0WMMMO;.....'oXWMMWXo'...cXMMMWx'.................
.......;ONWWWWMWWWWWNKOd;'...;ONWWWk,......'oKNWWWXo'..cKWWWXd'.................
.......';cllloooollc:,'.......;cccc;........';cclcc;'..':cccc,..................
................................................................................
................................................................................

usage: scraper.py [-h] [--url URL] [--show-url SHOW_URL] [--destination DESTINATION]

optional arguments:
  -h, --help            show this help message and exit
  --url URL             Direct episode to crawl e.g. https://www.srf.ch/play/tv/ding-dong/video/in-der-evangelischen-wg-und-beim-handorgel-sammler?urn=urn:srf:video:0e1bae7e-5a42-40af-84cc-3d97cc6c13be
  --show-url SHOW_URL   Show url to download e.g. "https://www.srf.ch/play/tv/sendung/ding-dong?id=b558ca46-e1ce-442c-932d-571e5a6ad323"
  --destination DESTINATION
                        Destination to save to (default docker /output/)
  --year YEAR           Download only episodes from given year

Run

# download episodes of a show to current dir 
docker run --rm -v $(pwd):/output mbharanya/srf-scraper --show-url https://www.srf.ch/play/tv/sendung/ding-dong?id=b558ca46-e1ce-442c-932d-571e5a6ad323
# download specific episode to current dir 
docker run --rm -v $(pwd):/output mbharanya/srf-scraper --url https://www.srf.ch/play/tv/sendung/ding-dong?id=b558ca46-e1ce-442c-932d-571e5a6ad323

Files will be downloaded with the episode date and if available the title (older episodes do not have a title). Example: Ding Dong-2022-03-11-In der evangelischen WG und beim Handorgel-Sammler.mp4

Disclaimer

I am not affiliated in any way with SRF at all. Please respect rate limits and server costs of our public service. This script should only be used for archival purposes.

About

Scrape all episodes of an SRF show


Languages

Language:Python 96.2%Language:Dockerfile 3.8%