使自然语言处理非拟人化：语言模型能有意识吗？

Deanthropomorphising NLP: Can a language model be conscious?

作者信息

Shardlow Matthew, Przybyła Piotr

机构信息

Department of Computing and Mathematics, Manchester Metropolitan University, Manchester, United Kingdom.

LaSTUS, Universitat Pompeu Fabra, Barcelona, Spain.

出版信息

PLoS One. 2024 Dec 4;19(12):e0307521. doi: 10.1371/journal.pone.0307521. eCollection 2024.

DOI:10.1371/journal.pone.0307521

PMID:39631034

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11617003/

Abstract

This work is intended as a voice in the discussion over previous claims that a pretrained large language model (LLM) based on the Transformer model architecture can be sentient. Such claims have been made concerning the LaMDA model and also concerning the current wave of LLM-powered chatbots, such as ChatGPT. This claim, if confirmed, would have serious ramifications in the Natural Language Processing (NLP) community due to wide-spread use of similar models. However, here we take the position that such a large language model cannot be conscious, and that LaMDA in particular exhibits no advances over other similar models that would qualify it. We justify this by analysing the Transformer architecture through Integrated Information Theory of consciousness. We see the claims of sentience as part of a wider tendency to use anthropomorphic language in NLP reporting. Regardless of the veracity of the claims, we consider this an opportune moment to take stock of progress in language modelling and consider the ethical implications of the task. In order to make this work helpful for readers outside the NLP community, we also present the necessary background in language modelling.

摘要

这项工作旨在参与此前关于基于Transformer模型架构的预训练大语言模型（LLM）可能具有感知能力的讨论。关于LaMDA模型以及当前一波由LLM驱动的聊天机器人（如ChatGPT），都有过此类说法。如果这一说法得到证实，由于类似模型的广泛使用，将在自然语言处理（NLP）社区产生严重影响。然而，我们在此表明立场，即这样的大语言模型不可能有意识，特别是LaMDA与其他类似模型相比并无突出进展使其具备感知能力。我们通过意识的整合信息理论分析Transformer架构来证明这一点。我们认为关于感知能力的说法是NLP报道中使用拟人化语言这一更广泛趋势的一部分。无论这些说法的真实性如何，我们认为这是一个评估语言建模进展并思考该任务伦理影响的适当时机。为了使这项工作对NLP社区之外的读者有所帮助，我们还介绍了语言建模的必要背景知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c23d/11617003/17c1fa3174b1/pone.0307521.g001.jpg

相似文献

Deanthropomorphising NLP: Can a language model be conscious?使自然语言处理非拟人化：语言模型能有意识吗？

PLoS One. 2024 Dec 4;19(12):e0307521. doi: 10.1371/journal.pone.0307521. eCollection 2024.

ChatGPT and large language model (LLM) chatbots: The current state of acceptability and a proposal for guidelines on utilization in academic medicine.ChatGPT 和大型语言模型 (LLM) 聊天机器人：在接受度方面的现状以及在学术医学中使用指南的建议。

J Pediatr Urol. 2023 Oct;19(5):598-604. doi: 10.1016/j.jpurol.2023.05.018. Epub 2023 Jun 2.

Generative large language models are all-purpose text analytics engines: text-to-text learning is all your need.生成式大语言模型是通用文本分析引擎：文本到文本学习就是你所需要的一切。

J Am Med Inform Assoc. 2024 Sep 1;31(9):1892-1903. doi: 10.1093/jamia/ocae078.

Automated anonymization of radiology reports: comparison of publicly available natural language processing and large language models.放射学报告的自动匿名化：公开可用的自然语言处理与大语言模型的比较

Eur Radiol. 2025 May;35(5):2634-2641. doi: 10.1007/s00330-024-11148-x. Epub 2024 Oct 31.

Optimizing biomedical information retrieval with a keyword frequency-driven prompt enhancement strategy.基于关键词频率驱动的提示增强策略优化生物医学信息检索

BMC Bioinformatics. 2024 Aug 27;25(1):281. doi: 10.1186/s12859-024-05902-7.

Practical Evaluation of ChatGPT Performance for Radiology Report Generation.ChatGPT 在放射科报告生成中的实际应用评估。

Acad Radiol. 2024 Dec;31(12):4823-4832. doi: 10.1016/j.acra.2024.07.020. Epub 2024 Aug 13.

Potential of Large Language Models in Health Care: Delphi Study.大语言模型在医疗保健中的潜力：德尔菲研究。

J Med Internet Res. 2024 May 13;26:e52399. doi: 10.2196/52399.

A functional contextual, observer-centric, quantum mechanical, and neuro-symbolic approach to solving the alignment problem of artificial general intelligence: safe AI through intersecting computational psychological neuroscience and LLM architecture for emergent theory of mind.一种解决通用人工智能对齐问题的功能情境、以观察者为中心、量子力学和神经符号方法：通过交叉计算心理神经科学和大语言模型架构实现安全人工智能，以形成涌现的心理理论。

Front Comput Neurosci. 2024 Aug 8;18:1395901. doi: 10.3389/fncom.2024.1395901. eCollection 2024.

AI chatbots not yet ready for clinical use.人工智能聊天机器人尚未准备好用于临床。

Front Digit Health. 2023 Apr 12;5:1161098. doi: 10.3389/fdgth.2023.1161098. eCollection 2023.

AMMU: A survey of transformer-based biomedical pretrained language models.基于变压器的生物医学预训练语言模型综述。

J Biomed Inform. 2022 Feb;126:103982. doi: 10.1016/j.jbi.2021.103982. Epub 2021 Dec 31.

引用本文的文献

The influence of mental state attributions on trust in large language models.心理状态归因对大型语言模型信任度的影响。

Commun Psychol. 2025 May 25;3(1):84. doi: 10.1038/s44271-025-00262-1.

Folk psychological attributions of consciousness to large language models.民间心理学对大语言模型的意识归因。

Neurosci Conscious. 2024 Apr 13;2024(1):niae013. doi: 10.1093/nc/niae013. eCollection 2024.

本文引用的文献

The debate over understanding in AI's large language models.人工智能大型语言模型中的理解之争。

Proc Natl Acad Sci U S A. 2023 Mar 28;120(13):e2215907120. doi: 10.1073/pnas.2215907120. Epub 2023 Mar 21.

Rapamycin in the context of Pascal's Wager: generative pre-trained transformer perspective.帕斯卡赌注视角下的雷帕霉素：生成式预训练变换器观点。

Oncoscience. 2022 Dec 21;9:82-84. doi: 10.18632/oncoscience.571. eCollection 2022.

Open artificial intelligence platforms in nursing education: Tools for academic progress or abuse?护理教育中的开放人工智能平台：学术进步的工具还是滥用的手段？

Nurse Educ Pract. 2023 Jan;66:103537. doi: 10.1016/j.nepr.2022.103537. Epub 2022 Dec 16.

Does Machine Understanding Require Consciousness?机器理解需要意识吗？

Front Syst Neurosci. 2022 May 18;16:788486. doi: 10.3389/fnsys.2022.788486. eCollection 2022.

Quantile Graphical Models: Bayesian Approaches.分位数图形模型：贝叶斯方法。

J Mach Learn Res. 2020;21(79):1-47.

Word meaning in minds and machines.思维与机器中的词义。

Psychol Rev. 2023 Mar;130(2):401-431. doi: 10.1037/rev0000297. Epub 2021 Jul 22.

Stochasticity Versus Determinacy in Neurobiology: From Ion Channels to the Question of the "Free Will".神经生物学中的随机性与确定性：从离子通道到“自由意志”问题

Front Syst Neurosci. 2021 May 26;15:629436. doi: 10.3389/fnsys.2021.629436. eCollection 2021.

Implications of Noise on Neural Correlates of Consciousness: A Computational Analysis of Stochastic Systems of Mutually Connected Processes.噪声对意识神经关联的影响：相互连接过程的随机系统的计算分析

Entropy (Basel). 2021 May 8;23(5):583. doi: 10.3390/e23050583.

Who Gets Credit for AI-Generated Art?谁该为人工智能生成的艺术作品获得赞誉？

iScience. 2020 Aug 29;23(9):101515. doi: 10.1016/j.isci.2020.101515. eCollection 2020 Sep 25.

Anthropomorphism in AI.人工智能中的拟人化。

AJOB Neurosci. 2020 Apr-Jun;11(2):88-95. doi: 10.1080/21507740.2020.1740350.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使自然语言处理非拟人化：语言模型能有意识吗？

Deanthropomorphising NLP: Can a language model be conscious?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献