Applying Decision Tree to Investigate How Personality Traits Influence Rational Decision-Making Behaviors
- Author: Qinyi Chen, Behavioral Science/Neuroscience, Class of 2026, Duke Kunshan University
- Instructor: Professor Luyao Zhang, Duke Kunshan University
- Disclaimer: This project is a submission to the Final Project for STATS201 Introduction to Machine Learning for Social Science, 2023 Autumn Term (Seven Week - Second) instructed by Prof. Luyao Zhang at Duke Kunshan University.
- Acknowledgments: I extend my sincere gratitude to Professor Luyao Zhang for her invaluable supervision and insightful teaching during the STATS 201 machine learning for social science course. Her guidance and support were instrumental in shaping the development of this project. Additionally, I would like to acknowledge the helpful suggestions made by my peers in this class such as Yifei Luo in broadening my ideas, as well as my friends not in this course. Sincere gratitude to my roommate Yihan Wang and my friend Juntang Wang for helping me query datasets that are suitable for research. Moreover, thanks to my friend Chinyu Chen for helping me with the bugged code.
Applying Decision Tree to Investigate How Personality Traits Influence Rational Decision-Making Behaviors Flow Chart Created by Marksmap
In behavioral science, rationality describes a person's ability to make proper decisions by utilizing optimal outcomes, often by mitigating various cognitive biases. The study "Cognitive Biases and Mindfulness" (Maymin & Langer, 2021) delved into the relationship between mindfulness and rationality. It surveyed participants on 22 cognitive biases across three conditions: Mindful, Mindequal, and Mindless, alongside conducting the Langer Mindfulness Survey (LMS) to measure personality traits influencing rational decision-making. The study concluded that a mindful state could enhance performance by reducing cognitive biases and making rational decisions.
How do different personality traits correlate with rational decision making behaviors?
This research will be applicable in personal development, educational settings, and professional environments where decision-making and cognitive bias mitigation are critical.
Model Building The study utilizes Decision Tree models to analyze the data. The X variable includes multiple personality traits, and the Y variable includes different aspects of rationality indexes. Under this circumstance, the Decision Tree is most suitable for modeling two variables containing multi-dimensional input. Data Visualization The correlation between total score of rationality and each personality trait was calculated. Feature importance plots are employed to visually explore the relationships between personality traits and rational decision-making capacities.
- Development of a Machine Learning Model of Rationality: By integrating personality traits and rational decision-making metrics into a predictive model, this research contributes to the quantitative understanding of human behavior in decision sciences.
- Exploration of Correlations: This project seeks to uncover potential correlations between rationality and various personality traits, offering a deeper understanding of how inherent characteristics influence decision-making processes.
- Practical Recommendations for Personal and Professional Growth: The findings could provide actionable insights for individuals to adapt certain personality traits in work settings or other critical situations, fostering self-improvement and enhancing decision-making efficacy.
Maymin, Philip Z., and Ellen J. Langer. “Cognitive Biases and Mindfulness.” Humanities and Social Sciences Communications 8, no. 1 (February 3, 2021). https://doi.org/10.1057/s41599-021-00712-1.
Research Question Formulation
- Objective: The primary objective of this research is to explore and identify potential correlations between various personality traits and cognitive biases using machine learning techniques.
- Significance: This question is crucial as it could provide insights into how intrinsic personality factors influence cognitive biases, potentially informing psychological and behavioral interventions.
Operational Measures
- Variables
- Dependent (Y): Rationality indexes (evaluated by the ability to avoid multiple biases).
- Independent (X): Personality traits measured by a standardized personality inventory.
- Data Type: The dataset is cross-sectional, gathered from a series of assessments across a diverse population at a single point in time.
Hypothesis Development
- Prediction Hypothesis: It's hypothesized that certain personality traits such as openness and neuroticism are predictive of the extent to which an individual exhibits specific cognitive biases.
- Justification: Previous research suggests that personality traits can influence decision-making processes and vulnerability to cognitive biases.
- Machine Learning Algorithm Selection: Decision Trees and Random Forests were selected due to their efficacy in handling non-linear relationships and interactions between multiple predictors, which is typical in psychological data.
Model Development
- Data Processing: The data were cleaned for missing values and outliers, and variables were normalized to ensure uniform scale. Feature engineering was conducted to create interaction terms that might reveal complex associations.
Results Presentation
- Training and Testing: The model was trained on 70% of the data with the remaining 30% used for testing. Performance metrics and confusion matrices were used to evaluate outcomes.
- Data Visualization: Results were visualized using ROC curves, precision-recall curves, and feature importance plots to illustrate the contributions of different variables.
Model Evaluation
- Evaluation Criteria: The models were evaluated based on accuracy, precision, recall, and the area under the ROC curve (AUC).
- Iterative Improvement: Techniques such as hyperparameter tuning and cross-validation were employed to refine the models and improve their predictive accuracy.
The data was found from Wolfram Data Repository-Maymin--Langer-2021-Cognitive-Biases-and-Mindfulness
Data dictionary
Table 1:Rationality Index
Table 2: Personality Traits
Variable Name | Inquiry Question | Max | Min | Mean | STD |
---|---|---|---|---|---|
Less_creative | I generate few novel ideas. | 7 | 0 | 3.541284 | 1.941420 |
Novelty_producing | I make many novel contributions. | 7 | 1 | 4.275229 | 1.626582 |
Engagement | I seldom notice what other people are up to. | 7 | 1 | 2.853211 | 1.626321 |
Low_novelty_seeking | I avoid thought-provoking conversations. | 7 | 1 | 2.293578 | 1.523293 |
Creativity | I am very creative. | 7 | 1 | 4.495413 | 1.636527 |
Curiosity | I am very curious. | 7 | 1 | 5.431193 | 1.529748 |
Novelty_methods | I try to think of new ways of doing things. | 7 | 1 | 5.348624 | 1.511537 |
Agility | I am rarely aware of changes. | 7 | 1 | 2.311927 | 1.274372 |
Intellectual_challenge | I like to be challenged intellectually. | 7 | 1 | 5.550459 | 1.443287 |
Creative_producing | I find it easy to create new and effective ideas. | 7 | 1 | 4.587156 | 1.717077 |
Insight_of_environment | I am rarely alert to new developments. | 7 | 1 | 2.541284 | 1.384652 |
Aspiration_to_knowledge | I like to figure out how things work. | 7 | 1 | 5.394495 | 1.509343 |
Conservative_thinking | I am not an original thinker. | 7 | 1 | 2.889908 | 1.657416 |
Model Accuracy: 0.5625
On average, the model correctly predicts approximately 56.25% of the labels across all the samples in the test set. This accuracy is lower than expected but still acceptable.
The lower prediction accuracy can be possibly due to the complexity of both X and Y variables. In the personality traits, there are often 2 or 3 questions inquiring about the same personality traits, yet the answer made by participants still changes slightly. The explanation of this phenomenon is some mis-interpreted self-evaluation, and that interferes with the modeling process.
Additionally, the low accuracy can also be attributed to the raw data processing period. Since it is not appropriate to clean all the NaNs in this dataset, which will return a relatively tiny database, I dropped some columns that contain too many NaNs in both rationality and personality. Additionally, I artificially filled some still-existing NaNs with 0 (the scale leans to less rationality). This may lead to an underestimation of the overall rationality. Therefore, if the research can be conducted again with completed database, the accuracy of the model should increase.
Correlation:
- Rationality and Personality
Strong Negative Correlation: total_rationality_score has a strong negative correlation with Less_creative (r = -0.10), and Low_novelty_seeking (r = -0.19), suggesting that higher rationality is associated with being more creative and seeking novelty. Strong Positive Correlation: total_rationality_score has a relatively strong positive correlation with Curiosity and willingness to take on new intellectual challenges Intellectual_challenge, which indicates that curious and intellectually active people tend to perform more rationally.
- Within Personality Traits
Strong Positive Correlations: There are several variables that show strong positive correlations with each other, for instance: Creativity and Curiosity (r = 0.53) Creativity and Novelty_methods (r = 0.51) Intellectual_challenge and Aspiration_to_knowledge (r = 0.64) These strong positive correlations indicate that as one trait increases, the other does too, for example, those who enjoy intellectual challenges also aspire to gain knowledge.
- Data Source: Wolfram Data Repository-Maymin--Langer-2021-Cognitive-Biases-and-Mindfulness
- Code Source: Final_Project_notebook
- Article Sources:Maymin, Philip Z., and Ellen J. Langer. “Cognitive Biases and Mindfulness.” Humanities and Social Sciences Communications 8, no. 1 (February 3, 2021). https://doi.org/10.1057/s41599-021-00712-1.