hdvos / Critical_Replication_ML_in_PA

Te reproduction code for the Replication Study

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Critical_Replication_ML_in_PA

The reproduction code for the Replication Study

Explanatory Memo regarding data files appended to the submission Small data problems in political research: a critical replication study This data folder contains 4 subfolders:

  1. Data: this folder contains all data that is needed to replicate the study. These are 2 .csv files (can be opened with among others Microsoft Excel). coded-tweet-data.csv contains the labeled tweets that serve as training set and test set to the machine learning model. agency_tweets_database.csv contains unlabeled tweets. N.B. Both files are identical to the files appended to Anastasopoulos and Whitford (2019).
  2. rmd_files: This folder contains 3 R-markdown files1. These files can be opened and run with R-studio2. Executing these files will replicate the entire study. If the data-files are kept in their folder after extraction of the zip, the script is able to find them without any alterations. Note that for the script to run properly, some extra R-libraries need to be installed, preferably the versions indicated in the R-markdown files.
  3. html_files: This folder contains the content of the rmd-files (including output) in HTML-format. These files can be opened with any web browser (firefox or google chrome are recommended.). These files contain no new information regarding the rmd-files, but are meant for people that are not interested in running the code themselves.
  4. Figures: this folder contains all the figures created with the scripts. When the rmd-files are run after extraction of the zip-file, all figures will be placed in this folder.

About

Te reproduction code for the Replication Study


Languages

Language:HTML 99.9%Language:Jupyter Notebook 0.1%