School of Psychology, Georgia Institute of Technology, Atlanta, GA 30332.
Proc Natl Acad Sci U S A. 2024 Nov 26;121(48):e2410487121. doi: 10.1073/pnas.2410487121. Epub 2024 Nov 22.
The Bayesian confidence hypothesis (BCH), which postulates that confidence reflects the posterior probability that a decision is correct, is currently the most prominent theory of confidence. Although several recent studies have found evidence against it in the context of relatively complex tasks, BCH remains dominant for simpler tasks. The major alternative to BCH is the confidence in raw evidence space (CRES) hypothesis, according to which confidence is based directly on the raw sensory evidence without explicit probability computations. Here, we tested these competing hypotheses in the context of perceptual tasks that are assumed to induce Gaussian evidence distributions. We show that providing information about task difficulty gives rise to a basic behavioral signature that distinguishes BCH from CRES models even for simple 2-choice tasks. We examined this signature in three experiments and found that all experiments exhibited behavioral signatures in line with CRES computations but contrary to BCH ones. We further performed an extensive comparison of 16 models that implemented either BCH or CRES confidence computations and systematically differed in their auxiliary assumptions. These model comparisons provided overwhelming support for the CRES models over their BCH counterparts across all model variants and across all three experiments. These observations challenge BCH and instead suggest that humans may make confidence judgments by placing criteria directly in the space of the sensory evidence.
贝叶斯置信度假设(BCH)认为,置信度反映了决策正确的后验概率,它是目前最主要的置信度理论。尽管最近的几项研究在相对复杂的任务背景下发现了它的证据,但在简单的任务中,BCH 仍然占主导地位。BCH 的主要替代理论是原始证据空间置信度(CRES)假设,根据该假设,置信度是直接基于原始感觉证据得出的,而不需要进行明确的概率计算。在这里,我们在被认为会产生高斯证据分布的感知任务背景下测试了这些相互竞争的假设。我们表明,提供有关任务难度的信息会产生一个基本的行为特征,即使对于简单的 2 选择任务,也可以将 BCH 与 CRES 模型区分开来。我们在三个实验中检验了这个特征,并发现所有实验都表现出与 CRES 计算相符但与 BCH 计算相反的行为特征。我们进一步对 16 个模型进行了广泛的比较,这些模型分别实现了 BCH 或 CRES 置信度计算,并且在辅助假设方面存在系统性差异。这些模型比较在所有模型变体和所有三个实验中都强烈支持 CRES 模型,而不是 BCH 模型。这些观察结果对 BCH 提出了挑战,而是表明人类可能通过将标准直接放置在感觉证据空间中来做出置信度判断。