ianmilligan1 / Aarhus-Netlab

NetLab Meeting, Aarhus

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NetLab Meeting, 22 September 2015, Aarhus University (Denmark)

Virtual Machine with Gephi running Figure 1: Virtual Machine with Gephi running

I was invited to speak to the Danish NetLab about historians and web archives. The presentation includes:

  • why I think historians need to use web archives;
  • why existing acccess methods (i.e. Wayback Machine or Archive-It search) are insufficient;
  • how a combination of Warcbase and Shine can help you unlock your collections!

Slidedeck has been uploaded as a PDF, available here. Download it here

I was also invited to speak, later in the day, at the Centre for Internet Studies on my work with the GeoCities torrent. The page about my talk is available here.

Some links follow in the sections below.

Warcbase

The Warcbase Wiki is available here
The Gephi: Converting Site Link Structure is available here

And sample data provided in this repo: output from Site Link Structure is part-r-00000; output from pig2gdf.py can be found in political-links.csv.

I will show a brief example with Gephi. If you have trouble using Gephi, see my section on troubleshooting below.

Shine

WebArchives.ca: a search interface for Archive-It's collection of Canadian Political Parties and Interest Groups.

Shine Figure 2: Shine in Action

WebArchives.ca is an implementation of the UK Web Archive's Shine Interface – an amazing front end that I'll be talking about in the lecture.

It received some media attention in Canada, showing an appetite for this sort of material.

Gephi

Gephi used to be quite difficult to get up and running, but the recent release of Gephi 0.9 has changed this dramatically. It should work out of the box.

Questions/Comments

I am always happy to chat. My personal website is at ianmilligan.ca or you can [e-mail me](mailto:i2millig@uwaterloo.ca].

About

NetLab Meeting, Aarhus


Languages

Language:Python 100.0%