web-archive-group / heritrix-walkthrough

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Heritrix Walkthrough

Introduction

This is a virtual machine and walkthrough for Heritrix. Heritrix documentation can be found here.

The virtual machine that is built uses 2GB of RAM. Your host machine will need to be able to support that.

The walkthrough can be found in the walkthrough subdirectory.

Requirements

Download each of the following dependencies.

  1. VirtualBox
  2. Vagrant
  3. Git

Virtual Machine

You'll need to get your virtual machine running on the command line. For a basic walkthrough of how to use the command line, please consult this lesson at the Programming Historian.

  1. git clone https://github.com/web-archive-group/heritrix-walkthrough.git
  2. cd heritrix-walkhrough
  3. vagrant up

Environment

  • Ubuntu 14.04
  • Java 7
  • Heritrix 3.2.0

Authors

Acknowlegements

This research has been supported by the Social Sciences and Humanities Research Council with Insight Grant 435-2015-0011.

About

License:MIT License


Languages

Language:Shell 100.0%