Department of Computer Science, Stanford University, Stanford, CA 94305, USA; Social & Decision Sciences, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA.
Social & Decision Sciences, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA; Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA.
Cognition. 2022 Aug;225:105120. doi: 10.1016/j.cognition.2022.105120. Epub 2022 Apr 8.
Mathematical proofs are both paradigms of certainty and some of the most explicitly-justified arguments that we have in the cultural record. Their very explicitness, however, leads to a paradox, because the probability of error grows exponentially as the argument expands. When a mathematician encounters a proof, how does she come to believe it? Here we show that, under a cognitively-plausible belief formation mechanism combining deductive and abductive reasoning, belief in mathematical arguments can undergo what we call an epistemic phase transition: a dramatic and rapidly-propagating jump from uncertainty to near-complete confidence at reasonable levels of claim-to-claim error rates. To show this, we analyze an unusual dataset of forty-eight machine-aided proofs from the formalized reasoning system Coq, including major theorems ranging from ancient to 21st Century mathematics, along with five hand-constructed cases including Euclid, Apollonius, Hernstein's Topics in Algebra, and Andrew Wiles's proof of Fermat's Last Theorem. Our results bear both on recent work in the history and philosophy of mathematics on how we understand proofs, and on a question, basic to cognitive science, of how we justify complex beliefs.
数学证明既是确定性的典范,也是我们在文化记录中所拥有的最明确论证之一。然而,它们的明确性导致了一个悖论,因为随着论证的扩展,错误的概率呈指数级增长。当数学家遇到一个证明时,她是如何相信它的呢?在这里,我们表明,在一种结合演绎推理和溯因推理的认知上合理的信念形成机制下,对数学论证的信念可以经历我们所谓的认知相变:从不确定性到近乎完全置信的急剧而迅速传播的跳跃,在合理的主张到主张错误率水平下。为了证明这一点,我们分析了来自形式化推理系统 Coq 的四十八个机器辅助证明的异常数据集,其中包括从古代到 21 世纪数学的主要定理,以及包括欧几里得、阿波罗尼斯、赫尔斯坦的《代数学论题》和安德鲁·怀尔斯的费马大定理证明在内的五个手工构建的案例。我们的结果既涉及到数学史和哲学中关于我们如何理解证明的最新工作,也涉及到认知科学中关于我们如何证明复杂信念的基本问题。