BBC-Data-Unit / spanish-flu-anniversary

Spanish flu: 'We didn't know who we'd lose next'

Home Page:https://www.bbc.co.uk/news/uk-england-45097068

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spanish flu: 'We didn't know who we'd lose next'

In September 2018 the data unit analysed archives from the Wellcome Library to show the impact of the Spanish Flu pandemic a century earlier.

The BBC England data unit analysed medical officers' reports for various English towns that had been digitised by the Wellcome Library in order to find out how many deaths were specifically attributed to influenza in 1918 and 1919.

Where possible we also gave the overall number of deaths recorded but the way in which individual officers recorded their statistics varied from area to area.

Information had to be manually transcribed from the scanned paper reports by BBC Radio Gloucestershire's Faye Hatcher with assistance from Daniel Wainwright from the England Data Unit, to compile figures for different regions and ages.

Scraping the documents was not feasible: figures appeared on different pages in different reports, and we couldn't guarantee the relevant heading would have been properly OCRd. Breakdowns by location were also inconsistently formatted - Berkshire's report had an 'urban districts' table with a specific influenza row; but the Cambridge one had a table introduced with the sentence "The areas in which these 151 deaths occurred are shown in the following table' - the locations are in rows not columns this time and 'influenza' is mentioned nowhere in the table itself.

Data entry was particularly challenging as different reports used different reporting conventions: for example, some officers only published total figures, some broke down deaths into one set of age ranges and others used different age ranges.

Creating a template for data entry which would accommodate this variety was part of the challenge.

Similarly, changes in administrative districts means that mapping the data would have to be done by central points, rather than area shapes, as shape files do not exist for the districts that existed in 1918.

Jen Meierhans told some of the tragic stories captured in letters written decades ago and shared with the BBC by the Wellcome Library.

As well as BBC Online, the story was also covered across various BBC radio outlets and BBC Breakfast.

Get the data

The data was compiled manually from scanned documents from 1918 and 1919 such as this one for Cambridge and this for Berkshire.

Visualisation

  • Map: Deaths in towns and cities during the Spanish flu outbreak of 1918 and 1919
  • Column chart: Ages of people dying of influenza in 1918 and 1919

Quotes and interviews

  • Hannah Mawdsley, PhD researcher, Queen Mary University of London
  • Quotes taken from an archive of letters written by survivors of the Spanish flu pandemic, held by the Imperial War Museum