Creating a random set of references

Question

Creating a random set of references

taleevenhuis opened this issue 6 months ago · comments

Feature Request

Following the SAFE-procedure (Boetje and Van de Schoot, 2023: https://osf.io/preprints/psyarxiv/c93gq), is it possible for the Screening Phase to create a random set of, say, the most recent 33 references, the middle 33 references and the oldest 33? In this way the model in ASReview is not only trained on recent terms, but also on older terms. If you use only the most recent references, the model will only be trained on new terminology cq jargon. Example: PTSD is a recent term, but Shell Shock is a term which describes the same phenomenon in WO-I.

bethgr · Answer 1 · Tue Feb 13 2024 21:40:07 GMT+0800 (China Standard Time)

Interesting idea! I would have thought this would be taken care of if the screening was truly random, then in theory these different naming conventions could be picked up. It does feel safer using your idea though to make sure there is not a jargon bias. Especially if a researcher is not aware of these terms ahead of time so did not mark those articles as relevant initially to train.

Bmcoimbra · Answer 2 · Wed Feb 14 2024 20:15:39 GMT+0800 (China Standard Time)

@taleevenhuis has a point. Disorders indeed have their terminology revised across the years, whether to include new concepts or to avoid stigmatizing language. PTSD was once called shell shock, traumatic neurosis, combat fatigue, etc., and has recently encompassed terms like secondary traumatic stress and compassion fatigue. More recently, Hoarding Disorder was considered a disorder itself rather than solely a symptom of OCD. For studies on psychology and psychiatry, taleevenhuis' idea could be helpful

Hmarck · Answer 3 · Thu Feb 15 2024 15:51:57 GMT+0800 (China Standard Time)

This might be a good idea, especially in fields where terminology undergoes significant changes over time. However, I am curious about the relevance of articles using older terms. One might question the significance of these articles and how applicable they are to current research.

Rens van de schoot · Answer 4 · Fri Apr 19 2024 19:05:39 GMT+0800 (China Standard Time)

The feature has been introduced in asreview/asreview-datatools#41 is now available in datatools.