CenterForPeaceAndSecurityStudies / ICBEdataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CC BY-NC-SA 4.0

The International Crisis Behavior Events (ICBe)

This is a github repository for the International Crisis Behavior Events (ICBe) dataset. Submit any issues regarding the dataset, paper, or github repository using the issues tab.

AS OF JUNE 2024, THE DATAVERSE REPLICATION FILES ARE THE MOST UP-TO-DATE. PLEASE USE THAT REPOSITORY FIRST.

The Paper:

Introducing the ICBe Dataset: Very High Recall and Precision Event Extraction from Narratives about International Crises (also available on ArXiv).

The v1.1 Online Appendix can be downloaded here.

Version 1.1 is the most recent version and was posted on ArXiv on July 26, 2022. Version 1.0 was posted on February 15, 2022.

The Data:

The agreed datasets are the final dataset used in much of the paper and figures. It includes our best efforts at cleaning the data and reconciling intercoder agreement. The dataset is available in long and wide format. The data is also available in .tsv format in the same folders.

The coding and cleaning process are described in the paper with additional information and details about the variables in the codebook. - ICBEdataset Codebook

The Authors:

Rex W. Douglass, Thomas Leo Scherer, J. Andrés Gannon, Erik Gartzke, Jon Lindsay, Shannon Carcelli, Jonathan Wilkenfeld, David M. Quinn, Catherine Aiken, Jose Miguel Cabezas Navarro, Neil Lund, Egle Murauskaite, and Diana Partridge.

Citation:

For any use of the dataset or paper, please cite:

Douglass, Rex W., Thomas Leo Scherer, J. Andrés Gannon, Erik Gartzke, Jon Lindsay, Shannon Carcelli, Jonathan Wiklenfeld, David M. Quinn, Catherine Aiken, Jose Miguel Cabezas Navarro, Neil Lund, Egle Murauskaite, and Diana Partridge. 2022. “Introducing the ICBe Dataset: Very High Recall and Precision Event Extraction from Narratives about International Crises.” arXiv:2202.07081 [cs, stat]. http://arxiv.org/abs/2202.07081.

Replication Code and Analysis

A description of the file and folders in the repository used to create the datasets, tables, figures.

data preparation

replication_corpus

  • download_and_clean
    • creates a succinct rds of the crisis narratives: .replication_corpus/data/out/icb_corpus_V1.0_May_16_2022

replication_data

  • 01_compile_saves_and_align
    • compiles the original coding files into ./replication_data/in/icb_long_spans.Rds. The original coding files are not on the public repository. Public users will load icb_long_spans.Rds directly.
    • aligns codings from multiple GUI versions on similar source sentences
  • 02_format_and_clean
    • applies cleaning dictionaries to create ./replication_data/out/ICBe_V1.1_long_clean.Rds.
  • 03_aggregation
    • applies aggregation algorithm to create ./replication_data/out/ICBe_V1.1_long_agreement.Rds, ICBe_V1.1_long.Rds, ./replication_data/out/ICBe_V1.1_events_agreed.Rds, ./replication_data/out/ICBe_V1.1_events_agreed_long.Rds.
  • 04_validation
    • applies iconography to crises and events to create ICBe_V1.1_crises_markdown.Rds and ICBe_V1.1_events_agreed_markdown.Rds

replication_paper

The figures are created in Rmd file for the paper (./replication_paper/pnas_draft/ICBe_pnas_submission_rmd.Rmd). In some cases they have been transformed to other formats via GNU Image Manipulation Program.

  • case_study_cuban_precision.png
  • recall_cuban_and_crimea_andcounts.png
  • p_precision_combined.png
    • combines metro maps of the two case studies using ICBe_V1.1_events_agreed.Rds
  • p_semantic_embeddings_dendro.png
    • plot of semantic embeddings of ICBe_V1.1_events_agreed_markdown.Rds
  • p_precision_icews.png
    • mapping of icews using ./replication_paper/data/out/icews_clean_471_lowest.tsv (created in ./replication_paper/pnas_draft/appendix.Rmd)

Data inputs:

ICBe preparation

  • Cleaning dictionaries: used to clean raw codings for actors, actions, locations, and dates /replication_paper/data/in/icb_manual_recording_master_sheet.xlsx
  • Lit review and tree/leaf codebook: replication_data/in/icbe_litreview_trees_sentences.xlsx and /replication_paper/data/in/icbe_litreview_trees_sentences.xlsx
  • Case study tables: /replication_paper/data/in/CaseStudies.xlsx

External datasets

Appendix

The v1.1 Online Appendix can be downloaded here.

License and DOI

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

CC BY-NC-SA 4.0

DOI

About

License:Creative Commons Attribution 4.0 International


Languages

Language:TeX 90.6%Language:HTML 6.5%Language:JavaScript 1.3%Language:PostScript 1.2%Language:Roff 0.4%