Science of Science Summer School (S4) 2022 #project-dark-side
Scrape retracted and up to 200 journal-year-matched-unretracted json records from OpenAlex
Combine retracted and (as run, up to 20) unretracted (as run, n = 113,937) into single file
Build core_df, the article-level dataframe with retracted and control measures
Scrape and calculate (simultaneously -- too large to cache) Funk & Owen-Smith-style disruptiveness measure
Take in raw shibayama novelty results and output the bare minimum of work ID, abstract, title scores
Merge together the core dataframe with outcomes (disruptiveness and novelty) to create analysis dataset
Do some analysis
output from reduce_shibayama_results.ipynb
output from 3_core_df.ipynb
output from 5_analysis_df.ipynb