• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

成为一个聊天机器人是什么样的?GPT-4眼中的世界。

What is it like to be a bot? The world according to GPT-4.

作者信息

Lloyd Dan

机构信息

Trinity College, Hartford, CT, United States.

出版信息

Front Psychol. 2024 Aug 7;15:1292675. doi: 10.3389/fpsyg.2024.1292675. eCollection 2024.

DOI:10.3389/fpsyg.2024.1292675
PMID:39176045
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11339530/
Abstract

The recent explosion of Large Language Models (LLMs) has provoked lively debate about "emergent" properties of the models, including intelligence, insight, creativity, and meaning. These debates are rocky for two main reasons: The emergent properties sought are not well-defined; and the grounds for their dismissal often rest on a fallacious appeal to extraneous factors, like the LLM training regime, or fallacious assumptions about processes within the model. The latter issue is a particular roadblock for LLMs because their internal processes are largely unknown - they are colossal black boxes. In this paper, I try to cut through these problems by, first, identifying one salient feature shared by systems we regard as intelligent/conscious/sentient/etc., namely, their responsiveness to environmental conditions that may not be near in space and time. They engage with subjective worlds ("s-worlds") which may or may not conform to the actual environment. Observers can infer s-worlds from behavior alone, enabling hypotheses about perception and cognition that do not require evidence from the internal operations of the systems in question. The reconstruction of s-worlds offers a framework for comparing cognition across species, affording new leverage on the possible sentience of LLMs. Here, we examine one prominent LLM, OpenAI's GPT-4. Inquiry into the emergence of a complex subjective world is facilitated with philosophical phenomenology and cognitive ethology, examining the pattern of errors made by GPT-4 and proposing their origin in the absence of an analogue of the human subjective awareness of time. This deficit suggests that GPT-4 ultimately lacks a capacity to construct a stable perceptual world; the temporal vacuum undermines any capacity for GPT-4 to construct a consistent, continuously updated, model of its environment. Accordingly, none of GPT-4's statements are epistemically secure. Because the anthropomorphic illusion is so strong, I conclude by suggesting that GPT-4 works with its users to construct improvised works of fiction.

摘要

近期大语言模型(LLMs)的爆发引发了关于模型“涌现”特性的热烈讨论,这些特性包括智能、洞察力、创造力和意义。这些讨论存在两大难点:一是所探寻的涌现特性定义不明确;二是对这些特性的否定往往基于对外部因素(如大语言模型的训练机制)的错误诉诸,或者基于对模型内部过程的错误假设。后一个问题对大语言模型来说是个特别的障碍,因为它们的内部过程很大程度上不为人知——它们是巨大的黑箱。在本文中,我试图解决这些问题,首先识别我们认为具有智能/意识/感知等的系统所共有的一个显著特征,即它们对时空上可能并不临近的环境条件的响应能力。它们与主观世界(“s - 世界”)互动,这些主观世界可能与实际环境相符,也可能不符。观察者仅从行为就能推断出s - 世界,从而提出关于感知和认知的假设,而无需来自相关系统内部运作的证据。s - 世界的重建提供了一个跨物种比较认知的框架,为探讨大语言模型可能的感知能力提供了新的视角。在此,我们研究一个著名的大语言模型,即OpenAI的GPT - 4。借助哲学现象学和认知行为学来探究复杂主观世界的涌现,研究GPT - 4所犯错误的模式,并提出这些错误源于缺乏人类对时间的主观意识的类似物。这种不足表明GPT - 4最终缺乏构建稳定感知世界的能力;时间真空破坏了GPT - 4构建其环境的一致、持续更新模型的任何能力。因此,GPT - 4的任何陈述在认知上都不可靠。由于拟人化错觉非常强烈,我最后建议GPT - 4与用户合作创作即兴虚构作品。

相似文献

1
What is it like to be a bot? The world according to GPT-4.成为一个聊天机器人是什么样的?GPT-4眼中的世界。
Front Psychol. 2024 Aug 7;15:1292675. doi: 10.3389/fpsyg.2024.1292675. eCollection 2024.
2
Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同行用户对解释非专业患者实验室检测结果的答案质量比较:评估研究。
J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.
3
Comparing the Performance of Popular Large Language Models on the National Board of Medical Examiners Sample Questions.比较流行的大语言模型在国家医学考试委员会样题上的表现。
Cureus. 2024 Mar 11;16(3):e55991. doi: 10.7759/cureus.55991. eCollection 2024 Mar.
4
Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study.大型语言模型在 3 个临床专业领域的治疗推荐中的应用:比较研究。
J Med Internet Res. 2023 Oct 30;25:e49324. doi: 10.2196/49324.
5
Large Language Models Versus Expert Clinicians in Crisis Prediction Among Telemental Health Patients: Comparative Study.大语言模型与专家临床医生在远程心理健康患者危机预测中的比较研究。
JMIR Ment Health. 2024 Aug 2;11:e58129. doi: 10.2196/58129.
6
Assessing GPT-4's Performance in Delivering Medical Advice: Comparative Analysis With Human Experts.评估 GPT-4 提供医疗建议的表现:与人类专家的比较分析。
JMIR Med Educ. 2024 Jul 8;10:e51282. doi: 10.2196/51282.
7
Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model.使用检索增强语言模型提高GPT-3/4在生物医学数据上的结果准确性。
PLOS Digit Health. 2024 Aug 21;3(8):e0000568. doi: 10.1371/journal.pdig.0000568. eCollection 2024 Aug.
8
Artificial Intelligence for Anesthesiology Board-Style Examination Questions: Role of Large Language Models.人工智能在麻醉学 board 式考试问题中的应用:大语言模型的作用。
J Cardiothorac Vasc Anesth. 2024 May;38(5):1251-1259. doi: 10.1053/j.jvca.2024.01.032. Epub 2024 Feb 1.
9
Peer review of GPT-4 technical report and systems card.GPT-4技术报告和系统卡片的同行评审。
PLOS Digit Health. 2024 Jan 18;3(1):e0000417. doi: 10.1371/journal.pdig.0000417. eCollection 2024 Jan.
10
Quality of Answers of Generative Large Language Models vs Peer Patients for Interpreting Lab Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同侪患者为非专业患者解读实验室检查结果的答案质量:评估研究
ArXiv. 2024 Jan 23:arXiv:2402.01693v1.

引用本文的文献

1
Digital Doppelgängers and Lifespan Extension: What Matters?数字分身与寿命延长:关键何在?
Am J Bioeth. 2025 Feb;25(2):95-110. doi: 10.1080/15265161.2024.2416133. Epub 2024 Nov 14.

本文引用的文献

1
An AI Mystery: Researchers are struggling to understand how artificial-intelligence models know things no one told them.
Sci Am. 2023 Sep 1;329(2):58. doi: 10.1038/scientificamerican0923-58.
2
Dissociating language and thought in large language models.大语言模型中的语言与思维分离。
Trends Cogn Sci. 2024 Jun;28(6):517-540. doi: 10.1016/j.tics.2024.01.011. Epub 2024 Mar 19.
3
Stable Consciousness? The "Hard Problem" Historically Reconstructed and in Perspective of Neurophenomenological Research on Meditation.意识稳定?“难题”的历史重构及冥想的神经现象学研究视角
Front Psychol. 2022 May 26;13:914322. doi: 10.3389/fpsyg.2022.914322. eCollection 2022.
4
How neuroscience will change our view on consciousness.神经科学将如何改变我们对意识的看法。
Cogn Neurosci. 2010 Sep;1(3):204-20. doi: 10.1080/17588921003731586. Epub 2010 Apr 15.
5
Constructive memory: past and future.建构性记忆:过去与未来。
Dialogues Clin Neurosci. 2012 Mar;14(1):7-18. doi: 10.31887/DCNS.2012.14.1/dschacter.
6
Global workspace theory of consciousness: toward a cognitive neuroscience of human experience.意识的全局工作空间理论:迈向人类体验的认知神经科学
Prog Brain Res. 2005;150:45-53. doi: 10.1016/S0079-6123(05)50004-9.