Wxyxixixi / MBTI-Personality-Prediction-using-Tweets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MBTI Personality Prediction Using Tweet History

About The Project

The aim of our project is to sort different people into their MBTI personality types using our machine learning classifier. For this purpose, we would be using various text samples available from their social media posts and accounts. As there is a large amount of data and genuine information available in a social media account as opposed to a single personality test, we think that the accuracy of our model would be much higher than that of a single personality test therefore we decided to work in this area. A successful implementation of such a model would demonstrate a strong connection between personality type and the language that we use in our daily life.

(back to top)

Tech Stack

This section should list any major frameworks/libraries used to bootstrap your project. Leave any add-ons/plugins for the acknowledgements section. Here are a few examples.

  • Python3
  • Natural Language Processing
  • Machine Learning

(back to top)

Requirements

  1. panda
  2. matplotlib
  3. numpy
  4. nltk
  5. sklearn
  6. keras
  7. wordcloud
  8. tweepy

Work Done

  • Analysis and Preprocessing of the training data downloaded from Kaggle.

  • Model trained on the following algorithms -
    1. Catboost
    2. Linear SVC
    3. Multinomial NB
    4. XGBoost
    5. KNN
    6. Decision Tree
    7. SGD
    8. SVC
      Linear SVC algorithm gave the best accuracy.

* Test Data Extraction from
  1. Forbes Most Significant Figures
  2. The World’s Most Powerful Women and People

Their MBTI Personality was gathered from Personality Database.Last 50 tweets were extracted.


* Test Data Visualization and Analysis

Test data preprocessing.Predictions made on test data using Linear SVC.


* Variation observed in the following personality types using their last 50 tweets:

Male Personalities

  1. Vladimir Putin
  2. Pope Francis
  3. Rupert Murdoch
  4. Lloyd Blankfein
  5. Jeff Bezos

Female Personalities

  1. Kamala Harris
  2. Jacinda Ardern
  3. Christine Lagarde
  4. Ursula von der Leyen
  5. Nancy Pelosi

(back to top)

Observations

*Jeff Bezos' tweets were greatly varying in feeling and thinking whereas Jacinda Arden's tweets were greatly varying in perception and judging.
*Vladimir Putin showed a consistent personality type in all his tweets whereas Kamala Harris’ mbti personality is not matching with personality type that she showed with her tweets.
*Also, we noticed that most of these significant figures showed a personality type INTP(Introversion Intuition Thinking Perception )through their tweets which is a little different than their original ones, which we believe could be a reason because of the formal language used in their tweets.

Acknowledgments

Use this space to list resources you find helpful and would like to give credit to. I've included a few of my favorites to kick things off!

(back to top)

About

License:MIT License


Languages

Language:Jupyter Notebook 100.0%