Abdurahman Suhaib, Atari Mohammad, Karimi-Malekabadi Farzan, Xue Mona J, Trager Jackson, Park Peter S, Golazizian Preni, Omrani Ali, Dehghani Morteza
Department of Psychology, University of Southern California, Los Angeles, CA 90089, USA.
Brain and Creativity Institute, University of Southern California, Los Angeles, CA 90089, USA.
PNAS Nexus. 2024 Jul 16;3(7):pgae245. doi: 10.1093/pnasnexus/pgae245. eCollection 2024 Jul.
The emergence of large language models (LLMs) has sparked considerable interest in their potential application in psychological research, mainly as a model of the human psyche or as a general text-analysis tool. However, the trend of using LLMs without sufficient attention to their limitations and risks, which we rhetorically refer to as "GPTology", can be detrimental given the easy access to models such as ChatGPT. Beyond existing general guidelines, we investigate the current limitations, ethical implications, and potential of LLMs specifically for psychological research, and show their concrete impact in various empirical studies. Our results highlight the importance of recognizing global psychological diversity, cautioning against treating LLMs (especially in zero-shot settings) as universal solutions for text analysis, and developing transparent, open methods to address LLMs' opaque nature for reliable, reproducible, and robust inference from AI-generated data. Acknowledging LLMs' utility for task automation, such as text annotation, or to expand our understanding of human psychology, we argue for diversifying human samples and expanding psychology's methodological toolbox to promote an inclusive, generalizable science, countering homogenization, and over-reliance on LLMs.
大语言模型(LLMs)的出现引发了人们对其在心理学研究中潜在应用的浓厚兴趣,主要是将其作为人类心理的模型或通用文本分析工具。然而,在使用大语言模型时,如果没有充分关注其局限性和风险(我们将这种现象戏称为“GPT学”),鉴于ChatGPT等模型易于获取,可能会产生不利影响。除了现有的一般指导方针外,我们专门针对心理学研究,探讨了大语言模型当前的局限性、伦理影响和潜力,并展示了它们在各种实证研究中的具体影响。我们的结果强调了认识全球心理多样性的重要性,告诫不要将大语言模型(特别是在零样本设置中)视为文本分析的通用解决方案,并开发透明、开放的方法来应对大语言模型的不透明性,以便从人工智能生成的数据中进行可靠、可重复和稳健的推断。认识到大语言模型在任务自动化(如文本注释)方面的效用,或有助于扩展我们对人类心理的理解,我们主张使人类样本多样化,并扩大心理学的方法工具箱,以促进一门包容、可推广的科学,对抗同质化和对大语言模型的过度依赖。