The datasets are collected from Reddit. The data collection and pre-processing details can be found in the paper in the next section.
Dataset Name | Collection Period | #Train/#Test/#Validation |
---|---|---|
Reddit-MH-2021 | Jan 2021 - Sep 2021 | 499858 / 166620 / 166619 |
Reddit-LS-2021 | Jan 2021 - Sep 2021 | 375783 / 125261 / 125261 |
Kaggle link to download the dataset
The dictionaries are constructed using professional assessment tools and post-verified by a psychology researcher.
Anxiety Dictionary: Generalized Anxiety Disorder Questionnaire (GAD-7)
Depression Dictionary: Beck's Depression Inventory (BDI)
If you would like to use the presented datasets or/and dictionary, you can cite this paper:
@INPROCEEDINGS{Xia:IJCNN:2024,
author={Xia Cui and Terry Hanley and Muj Choudhury and Tingting Mu},
booktitle={2024 International Joint Conference on Neural Networks (IJCNN)},
title={Data-Driven or Dataless? Detecting Indicators of Mental Health Difficulties and Negative Life Events in Financial Resilience Using Prompt-Based Learning },
month = "Jun",
year={2024 (In Press)},
address = "Yokohama, Japan",
volume={},
number={},
pages={},
keywords={},
doi={}}