janaab11 / scrape_media

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repo is meant to scrape media from the <>pod101.com series of websites. To begin with, set your base website and login details in the scripts/scraper.py script. Then run the run.sh script in root directory with --start and --end flags to download the corresponding lessons (starting from 2). For example run:

sh run.sh --start=2 --end=101

to download the first 100 lessons scraped. If you are feeling generous with your harddisk space, you can also run it will --all=1 flag to download all the scraped lessons.

For more details and control, look under the hood of run.sh

About


Languages

Language:Python 85.9%Language:Shell 14.1%