Hawkins Robert D, Tsvilodub Polina, Bergey Claire Augusta, Goodman Noah D, Franke Michael
Department of Linguistics, Stanford University, Stanford, CA 94305, USA.
Eberhard Karls Universitat Tubingen, Tubingen, Baden-Württemberg 72074, Germany.
Philos Trans R Soc Lond B Biol Sci. 2025 Aug 14;380(1932):20230505. doi: 10.1098/rstb.2023.0505.
People often provide answers that go beyond what a question literally asks, but it has been difficult to pin down what makes some answers more relevant than others. Here, we introduce Pragmatic Reasoning In Overinformative Responses to Polar Questions (PRIOR-PQ), a probabilistic cognitive model formalizing how people use theory of mind (ToM) to produce and interpret relevantly overinformative answers to yes-no questions. Specifically, PRIOR-PQ grounds the pragmatics of question answering in inferences about the underlying that motivated the questioner to ask the given question as opposed to a different question. We evaluate our probabilistic model against human answering behaviour elicited in three case studies of increasing complexity, demonstrating its ability to predict nuanced patterns of relevance better than existing models, including state-of-the-art large language models. We also show how the goal-sensitive reasoning instantiated in our probabilistic model motivates a novel chain-of-thought prompting method allowing language models to approach more human-like performance. This work illuminates the mechanistic role of ToM in the pragmatics of question-answer exchanges, bridging formal semantics, cognitive science and artificial intelligence. Our findings have implications for developing more socially grounded dialogue systems and highlight the importance of integrating explanatory cognitive models with machine learning approaches.This article is part of the theme issue 'At the heart of human communication: new views on the complex relationship between pragmatics and Theory of Mind'.
人们常常给出的答案超出了问题字面上的要求,但很难确定是什么使得一些答案比其他答案更切题。在此,我们引入了“二元问题过度信息性回答中的语用推理”(PRIOR-PQ),这是一种概率认知模型,它将人们如何运用心理理论(ToM)来生成和解释对是非问题的相关过度信息性回答进行了形式化。具体而言,PRIOR-PQ将问答的语用学建立在对潜在动机的推理之上,这种动机促使提问者提出给定的问题而非其他问题。我们在三个复杂度不断增加的案例研究中,根据人类的回答行为对我们的概率模型进行了评估,结果表明它比现有模型(包括最先进的大语言模型)更能预测细微的相关性模式。我们还展示了我们概率模型中实例化的目标敏感推理如何激发一种新颖的思维链提示方法,使语言模型能够接近更类人的表现。这项工作阐明了心理理论在问答交流语用学中的机制作用,架起了形式语义学、认知科学和人工智能之间的桥梁。我们的发现对开发更具社会基础的对话系统具有启示意义,并凸显了将解释性认知模型与机器学习方法相结合的重要性。本文是主题为“人类交流的核心:关于语用学与心理理论复杂关系的新观点”的一部分。