Dawit-Sh / PDFSCRAPE

PDFSCRAPE is a script written in Python which will allows you to automatically download pdfs from http://ndl.ethernet.et.edu

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pdfscrape

PDFSCRAPE

PDFSCRAPE is a script written in python which allows you to automatically download(scrape) pdfs from http://ndl.ethernet.et.edu

Features

  • Easy to use
  • Fast download
  • Scrapes pdfs only from selected department
  • Continue your downloads if download is interrupted.

Requirements

  • Beautiful soup

  • tqdm

Installation

clone this repo

gitclone https://github.com/L0rdC0mm4nd3r/pdfscrape

Install requirements

pip3 install -r requirements.txt

Usage

You can start PDFSCRAPE by

python3 pdfscrape.py

OR

chmod a+x pdfscrape.py && ./pdfscrape.py

GIF

Author

License

This project is licensed under the GNU General Public License v3.0 License - see the LICENSE file for details

Acknowledgments

  • Hat tip to anyone whose code was used

About

PDFSCRAPE is a script written in Python which will allows you to automatically download pdfs from http://ndl.ethernet.et.edu

License:GNU General Public License v3.0


Languages

Language:Python 100.0%