大型语言模型和人类在判断公众人物性格方面趋于一致。

Cao Xubo, Kosinski Michal

Graduate School of Business, Stanford University, Stanford, CA 94305, USA.

PNAS Nexus. 2024 Sep 19;3(10):pgae418. doi: 10.1093/pnasnexus/pgae418. eCollection 2024 Oct.

ChatGPT-4 and 600 human raters evaluated 226 public figures' personalities using the Ten-Item Personality Inventory. The correlation between ChatGPT-4 and aggregate human ratings ranged from = 0.76 to 0.87, outperforming the models specifically trained to make such predictions. Notably, the model was not provided with any training data or feedback on its performance. We discuss the potential explanations and practical implications of ChatGPT-4's ability to mimic human responses accurately.

ChatGPT-4和600名人类评分者使用十项人格量表对226位公众人物的性格进行了评估。ChatGPT-4与人类总体评分之间的相关性在0.76至0.87之间，优于专门训练用于进行此类预测的模型。值得注意的是，该模型没有获得任何关于其性能的训练数据或反馈。我们讨论了ChatGPT-4准确模仿人类反应能力的潜在解释和实际意义。

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

Large language models and humans converge in judging public figures' personalities.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献