News Scraper

A program for downloading online articles and saving them in a SQLLite database.

Getting Started

pip install beautifulsoup4

mkdir git
cd git
git clone https://github.com/th0rben/news-scraper.git

For sending e-mails you need to create a file named: login_data.py in src folder. It should look this (example for using gmail):

sender = "sender@gmail.com"
recipient = "recipient"
password = "password"
subject = "subject"
server = "smtp.gmail.com"
port = 465

Execute the main.py file

To scrap every day at 12:00 execute: (adds cronjob to crontab)

cd where/you/saved/it/news-scraper
sudo chmod +x setup.sh
.setup.sh
sudo crontab -e
0 18 * * 1 /home/pi/git/news-scraper/cron/cron.sh

If you want to change the frequency or time: Change cronjob.txt

Until today there are no tests.

It would be great if you mention any mistakes you stumble over.

It would be great if you mention any mistakes you stumble over.

[27.07.2018] - Scrap articles vom bild.de

This project is still in the Beta-Version

This project is licensed under the GPL License - see the LICENSE.md file for details

A program for downloading online articles and saving it in a SQLLite database.

GNU General Public License v3.0

Language:Python 96.1%Language:Shell 3.9%