fidian / webpage

Webpage robot that serves as a gateway between email and the web

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Webpage Grabber

This is an email to web gateway service. It uses fetchmail to receive emails, procmail to match emails against rules, then either links or wget to retrieve files.

Setup

This is broken into three phases so you can do a chunk at a time.

Prerequisites

  1. Ensure you have wget, zip, mime-construct, fetchmail, procmail, xvfb and a working SMTP server (I am using msmtp).

  2. Install LWP, LWP::Protocol::https and HTML::TokeParser via CPAN so Perl can run url.pl. On Ubuntu you need libnet-ssleay-perl and libcrypt-ssleay-perl for Net::SSLeay to work.

  3. Test url.pl and ensure it works. You may need to update Text::Wrap.

     echo -n "" | ./url.pl -t https://www.yahoo.com
    
  4. Install wkhtmltopdf so you have at least version 0.12.x in the path.

Set up Fetchmail

  1. Copy fetchmailrc-template to fetchmailrc.
  2. Change permissions on fetchmailrc: chmod 0600 fetchmailrc
  3. Run touch logs/fetchmail.log to create the initial log file.

Set up Outgoing Mail

  1. Install a MTA. I've configured msmtp to relay to another SMTP server. If you use procmail to relay, you may want to investigate adding message_size_limit = 20480000 to your config.

  2. Run the test files in tests/.

  3. Once that works, set up a cron job to run fetchmail.

     @reboot fetchmail -f webpage/fetchmailrc > /dev/null 2>&1
    

Configuration

Create a file called banned and add email addresses in there when you want users to be banned from using the system.

About

Webpage robot that serves as a gateway between email and the web


Languages

Language:Perl 56.4%Language:Shell 33.6%Language:HTML 10.0%