ethanyyao / pq_parser

Script to parse text file downloads from ProQuest's Global Newsstream database into CSV of metadata and full text.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Parse ProQuest Metadata

This notebook includes a python function to parse newspaper articles downloaded from ProQuest Global Newsstream into one CSV file with metadata and full text (when full text is available).

Created by Cody Hennesy and David Naughton (University of Minnesota, Twin Cities, Libraries). Email Cody (chennesy@umn.edu) with any questions.

About

Script to parse text file downloads from ProQuest's Global Newsstream database into CSV of metadata and full text.


Languages

Language:Jupyter Notebook 100.0%