essays-dataset lstm nlp svm text-classification word2vec

Personality Trait Classification on Essays

Lucija Arambašić*, Miroslav Bićanić*, Frano Rajič*

Report

Automatic classification of a person's personality based on a piece of text written by that person is an inherently difficult task, but its difficulty could increase depending on the dataset used. In this work, we explore the classification performance of many different machine learning models with various feature combinations when the dataset consists of stream-of-consciousness essays written by students. Despite achieving very good performance, we argue that such a dataset may not be ideal for personality trait classification.

About

"Essays are a Fickle Thing", a project done as part of the "ID222452 Text Analysis and Retrieval" course by Prof. Jan Šnajder at UniZG-FER

https://www.fer.unizg.hr/_download/repository/TAR-2021-ProjectReports.pdf#page=8

essays-dataset lstm nlp svm text-classification word2vec

Languages

Language:Python 65.4%Language:TeX 34.2%Language:Shell 0.4%