Lukac Martin
Koios, London, UK.
Sci Rep. 2024 Dec 3;14(1):30149. doi: 10.1038/s41598-024-81047-0.
This study introduces a novel method for predicting the Big Five personality traits through the analysis of speech samples, advancing the field of computational personality assessment. We collected data from 2045 participants who completed a self-reported Big Five personality questionnaire and provided free-form speech samples by introducing themselves without constraints on content. Using pre-trained convolutional neural networks and transformer-based models, we extracted embeddings representing both acoustic features (e.g., tone, pitch, rhythm) and linguistic content from the speech samples. These embeddings were combined and input into gradient boosted tree models to predict personality traits. Our results indicate that personality traits can be effectively predicted from speech, with correlation coefficients between predicted scores and self-reported scores ranging from 0.26 (extraversion) to 0.39 (neuroticism), and from 0.39 to 0.60 for disattenuated correlations. Intraclass correlations show moderate to high consistency in our model's predictions. This approach captures the subtle ways in which personality traits are expressed through both how people speak and what they say. Our findings underscore the potential of voice-based assessments as a complementary tool in psychological research, providing new insights into the connection between speech and personality.
本研究介绍了一种通过分析语音样本预测大五人格特质的新方法,推动了计算人格评估领域的发展。我们从2045名参与者那里收集了数据,这些参与者完成了一份自我报告的大五人格问卷,并通过自我介绍提供了自由形式的语音样本,内容不受限制。我们使用预训练的卷积神经网络和基于Transformer的模型,从语音样本中提取了代表声学特征(如语调、音高、节奏)和语言内容的嵌入。这些嵌入被组合起来并输入到梯度提升树模型中以预测人格特质。我们的结果表明,可以从语音中有效地预测人格特质,预测分数与自我报告分数之间的相关系数范围为0.26(外向性)至0.39(神经质),去衰减相关系数范围为0.39至0.60。组内相关显示我们模型的预测具有中度到高度的一致性。这种方法捕捉了人格特质通过人们说话方式和说话内容来表达的微妙方式。我们的研究结果强调了基于语音的评估作为心理学研究中一种补充工具的潜力,为语音与人格之间的联系提供了新的见解。