Bottom-up and top-down: Predicting personality with psycholinguistic and language model features

Word cloud visualization of the most important psycholinguistic features driving personality trait prediction.


State-of-the-art personality prediction with text datamostly relies on bottom up, automated feature generation as part of the deep learning process. More traditional models rely onhand-crafted, theory-based text-feature categories. We propose a novel deep learning-based model which integrates traditional psycholinguistic features with language model embeddings to predict personality from the Essays dataset for Big-Five and Kaggle dataset for MBTI. With this approach we achieve state-of-the-art model performance. Additionally, we use interpretable machine learning to visualize and quantify the impact of various language features in the respective personality prediction models. We conclude with a discussion on the potential this work has for computational modeling and psychological science alike.

Amirmohammad Kazemeini
Amirmohammad Kazemeini
Artificial Intelligence Intern

I am a Natural Language Processing intern who is passionate about psychology and social science.

comments powered by Disqus