Jang Jihee, Yoon Seowon, Son Gaeun, Kang Minjung, Choeh Joon Yeon, Choi Kee-Hong
School of Psychology, Korea University, Seoul, South Korea.
Department of Software, Sejong University, Seoul, South Korea.
Front Psychol. 2022 Apr 7;13:865541. doi: 10.3389/fpsyg.2022.865541. eCollection 2022.
Self-report multiple choice questionnaires have been widely utilized to quantitatively measure one's personality and psychological constructs. Despite several strengths (e.g., brevity and utility), self-report multiple choice questionnaires have considerable limitations in nature. With the rise of machine learning (ML) and Natural language processing (NLP), researchers in the field of psychology are widely adopting NLP to assess psychological construct to predict human behaviors. However, there is a lack of connections between the work being performed in computer science and that of psychology due to small data sets and unvalidated modeling practices.
The current article introduces the study method and procedure of phase II which includes the interview questions for the five-factor model (FFM) of personality developed in phase I. This study aims to develop the interview (semi-structured) and open-ended questions for the FFM-based personality assessments, specifically designed with experts in the field of clinical and personality psychology (phase 1), and to collect the personality-related text data using the interview questions and self-report measures on personality and psychological distress (phase 2). The purpose of the study includes examining the relationship between natural language data obtained from the interview questions, measuring the FFM personality constructs, and psychological distress to demonstrate the validity of the natural language-based personality prediction.
Phase I (pilot) study was conducted to fifty-nine native Korean adults to acquire the personality-related text data from the interview (semi-structured) and open-ended questions based on the FFM of personality. The interview questions were revised and finalized with the feedback from the external expert committee, consisting of personality and clinical psychologists. Based on the established interview questions, a total of 300 Korean adults will be recruited using a convenience sampling method online survey. The text data collected from interviews will be analyzed using the natural language processing. The results of the online survey including demographic data, depression, anxiety, and personality inventories will be analyzed together in the model to predict individuals' FFM of personality and the level of psychological distress (phase 2).
自陈式多项选择题问卷已被广泛用于定量测量一个人的人格和心理结构。尽管有几个优点(如简洁性和实用性),但自陈式多项选择题问卷本质上有相当大的局限性。随着机器学习(ML)和自然语言处理(NLP)的兴起,心理学领域的研究人员广泛采用NLP来评估心理结构以预测人类行为。然而,由于数据集小和建模实践未经验证,计算机科学领域的工作与心理学领域的工作之间缺乏联系。
本文介绍了第二阶段的研究方法和程序,其中包括第一阶段开发的人格五因素模型(FFM)的访谈问题。本研究旨在为基于FFM的人格评估开发访谈(半结构化)和开放式问题,这些问题是与临床和人格心理学领域的专家专门设计的(第一阶段),并使用访谈问题以及关于人格和心理困扰的自陈式测量方法收集与人格相关的文本数据(第二阶段)。该研究的目的包括检查从访谈问题中获得的自然语言数据、测量FFM人格结构与心理困扰之间的关系,以证明基于自然语言的人格预测的有效性。
第一阶段(试点)研究对59名韩国本土成年人进行,以从基于人格FFM的访谈(半结构化)和开放式问题中获取与人格相关的文本数据。访谈问题根据由人格和临床心理学家组成的外部专家委员会的反馈进行了修订和最终确定。基于已确定的访谈问题,将采用便利抽样方法在网上调查中招募总共300名韩国成年人。从访谈中收集的文本数据将使用自然语言处理进行分析。在线调查的结果,包括人口统计学数据、抑郁、焦虑和人格量表,将在模型中一起分析,以预测个体的人格FFM和心理困扰水平(第二阶段)。