Kjell Oscar N E, Kjell Katarina, Schwartz H Andrew
Psychology Department, Lund University, Sweden; Computer Science Department, Stony Brook University, United States.
Psychology Department, Lund University, Sweden.
Psychiatry Res. 2024 Mar;333:115667. doi: 10.1016/j.psychres.2023.115667. Epub 2023 Dec 10.
In this narrative review, we survey recent empirical evaluations of AI-based language assessments and present a case for the technology of large language models to be poised for changing standardized psychological assessment. Artificial intelligence has been undergoing a purported "paradigm shift" initiated by new machine learning models, large language models (e.g., BERT, LAMMA, and that behind ChatGPT). These models have led to unprecedented accuracy over most computerized language processing tasks, from web searches to automatic machine translation and question answering, while their dialogue-based forms, like ChatGPT have captured the interest of over a million users. The success of the large language model is mostly attributed to its capability to numerically represent words in their context, long a weakness of previous attempts to automate psychological assessment from language. While potential applications for automated therapy are beginning to be studied on the heels of chatGPT's success, here we present evidence that suggests, with thorough validation of targeted deployment scenarios, that AI's newest technology can move mental health assessment away from rating scales and to instead use how people naturally communicate, in language.
在这篇叙述性综述中,我们调查了近期基于人工智能的语言评估的实证研究,并提出了一个观点,即大语言模型技术有望改变标准化心理评估。人工智能一直在经历一场由新的机器学习模型——大语言模型(例如BERT、LAMMA以及ChatGPT背后的模型)引发的所谓“范式转变”。这些模型在大多数计算机化语言处理任务中都取得了前所未有的准确性,从网络搜索到自动机器翻译和问答,而它们基于对话的形式,如ChatGPT,已经吸引了超过百万用户的关注。大语言模型的成功主要归功于其在语境中以数字形式表示单词的能力,而这长期以来一直是以往从语言自动进行心理评估尝试的一个弱点。虽然在ChatGPT取得成功之后,自动治疗的潜在应用开始受到研究,但在此我们提供证据表明,经过对目标部署场景的全面验证,人工智能的这项最新技术能够使心理健康评估摆脱评级量表,转而利用人们自然的语言交流方式。