• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视觉搜索的统计模板

Statistical templates for visual search.

作者信息

Ackermann John F, Landy Michael S

机构信息

Department of Psychology, New York University, New York, NY, USA.

出版信息

J Vis. 2014 Mar 13;14(3):18. doi: 10.1167/14.3.18.

DOI:10.1167/14.3.18
PMID:24627458
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3954043/
Abstract

How do we find a target embedded in a scene? Within the framework of signal detection theory, this task is carried out by comparing each region of the scene with a "template," i.e., an internal representation of the search target. Here we ask what form this representation takes when the search target is a complex image with uncertain orientation. We examine three possible representations. The first is the matched filter. Such a representation cannot account for the ease with which humans can find a complex search target that is rotated relative to the template. A second representation attempts to deal with this by estimating the relative orientation of target and match and rotating the intensity-based template. No intensity-based template, however, can account for the ability to easily locate targets that are defined categorically and not in terms of a specific arrangement of pixels. Thus, we define a third template that represents the target in terms of image statistics rather than pixel intensities. Subjects performed a two-alternative, forced-choice search task in which they had to localize an image that matched a previously viewed target. Target images were texture patches. In one condition, match images were the same image as the target and distractors were a different image of the same textured material. In the second condition, the match image was of the same texture as the target (but different pixels) and the distractor was an image of a different texture. Match and distractor stimuli were randomly rotated relative to the target. We compared human performance to pixel-based, pixel-based with rotation, and statistic-based search models. The statistic-based search model was most successful at matching human performance. We conclude that humans use summary statistics to search for complex visual targets.

摘要

我们如何在场景中找到嵌入的目标?在信号检测理论的框架内,这项任务是通过将场景的每个区域与一个“模板”进行比较来完成的,即搜索目标的内部表征。在这里,我们要问当搜索目标是一个方向不确定的复杂图像时,这种表征会采取什么形式。我们研究了三种可能的表征。第一种是匹配滤波器。这样的表征无法解释人类能够轻松找到相对于模板旋转的复杂搜索目标的原因。第二种表征试图通过估计目标与匹配之间的相对方向并旋转基于强度的模板来解决这个问题。然而,没有基于强度的模板能够解释轻松定位按类别定义而非根据特定像素排列定义的目标的能力。因此,我们定义了第三种模板,它根据图像统计信息而非像素强度来表征目标。受试者执行了一项二选一的强制选择搜索任务,在该任务中他们必须定位与先前查看的目标匹配的图像。目标图像是纹理块。在一种情况下,匹配图像与目标相同,干扰项是相同纹理材料的不同图像。在第二种情况下,匹配图像与目标具有相同的纹理(但像素不同),干扰项是不同纹理的图像。匹配和干扰刺激相对于目标随机旋转。我们将人类的表现与基于像素、基于像素并带有旋转以及基于统计的搜索模型进行了比较。基于统计的搜索模型在匹配人类表现方面最为成功。我们得出结论,人类使用概要统计信息来搜索复杂的视觉目标。

相似文献

1
Statistical templates for visual search.视觉搜索的统计模板
J Vis. 2014 Mar 13;14(3):18. doi: 10.1167/14.3.18.
2
Task demands determine the specificity of the search template.任务需求决定了搜索模板的特异性。
Atten Percept Psychophys. 2012 Jan;74(1):124-31. doi: 10.3758/s13414-011-0224-5.
3
A summary statistic representation in peripheral vision explains visual search.周边视觉中的一种汇总统计表示法解释了视觉搜索。
J Vis. 2012 Apr 20;12(4):10.1167/12.4.14 14. doi: 10.1167/12.4.14.
4
Informative cues can slow search: the cost of matching a specific template.信息线索会减慢搜索速度:匹配特定模板的代价。
Atten Percept Psychophys. 2014 Jan;76(1):32-9. doi: 10.3758/s13414-013-0532-z.
5
Activation of new attentional templates for real-world objects in visual search.视觉搜索中真实物体新注意力模板的激活。
J Cogn Neurosci. 2015 May;27(5):902-12. doi: 10.1162/jocn_a_00747. Epub 2014 Oct 16.
6
The Time Course of Target Template Activation Processes during Preparation for Visual Search.视觉搜索准备过程中目标模板激活过程的时间进程。
J Neurosci. 2018 Oct 31;38(44):9527-9538. doi: 10.1523/JNEUROSCI.0409-18.2018. Epub 2018 Sep 21.
7
Probabilistic rejection templates in visual working memory.视觉工作记忆中的概率拒绝模板。
Cognition. 2020 Mar;196:104075. doi: 10.1016/j.cognition.2019.104075. Epub 2019 Dec 14.
8
Matching of visual input to only one item at any one time.在任何时刻,视觉输入仅与一个项目匹配。
Psychol Res. 2009 May;73(3):317-26. doi: 10.1007/s00426-008-0157-3. Epub 2008 Jul 30.
9
The specificity of the search template.搜索模板的特异性。
J Vis. 2009 Jan 23;9(1):34.1-9. doi: 10.1167/9.1.34.
10
The footprints of visual attention during search with 100% valid and 100% invalid cues.在使用100%有效线索和100%无效线索进行搜索时视觉注意力的轨迹。
Vision Res. 2004 Jun;44(12):1193-207. doi: 10.1016/j.visres.2003.10.026.

引用本文的文献

1
Higher baseline alpha power is associated with faster responses in visual search.较高的基线阿尔法波功率与视觉搜索中更快的反应相关。
bioRxiv. 2025 Aug 29:2025.08.29.673162. doi: 10.1101/2025.08.29.673162.
2
Visual Search Asymmetry: Deep Nets and Humans Share Similar Inherent Biases.视觉搜索不对称性:深度神经网络与人类具有相似的内在偏差。
Adv Neural Inf Process Syst. 2021 Dec;34:6946-6959.
3
Opposing effects of selectivity and invariance in peripheral vision.外周视觉中选择性和不变性的相反作用。
Nat Commun. 2021 Jul 28;12(1):4597. doi: 10.1038/s41467-021-24880-5.
4
Crowding and attention in a framework of neural network model.神经网络模型框架中的拥挤与注意力
J Vis. 2020 Dec 2;20(13):19. doi: 10.1167/jov.20.13.19.
5
The time-limited visual statistician.有时间限制的视觉统计学家。
J Exp Psychol Hum Percept Perform. 2016 Oct;42(10):1497-504. doi: 10.1037/xhp0000255. Epub 2016 Jun 23.
6
Selectivity and tolerance for visual texture in macaque V2.猕猴V2区对视觉纹理的选择性和耐受性
Proc Natl Acad Sci U S A. 2016 May 31;113(22):E3140-9. doi: 10.1073/pnas.1510847113. Epub 2016 May 12.
7
The perceptual processing capacity of summary statistics between and within feature dimensions.特征维度之间和内部的汇总统计量的感知处理能力。
J Vis. 2015;15(4):9. doi: 10.1167/15.4.9.
8
The capacity limitations of orientation summary statistics.方向汇总统计的容量限制。
Atten Percept Psychophys. 2015 May;77(4):1116-31. doi: 10.3758/s13414-015-0870-0.

本文引用的文献

1
A summary statistic representation in peripheral vision explains visual search.周边视觉中的一种汇总统计表示法解释了视觉搜索。
J Vis. 2012 Apr 20;12(4):10.1167/12.4.14 14. doi: 10.1167/12.4.14.
2
Rethinking the role of top-down attention in vision: effects attributable to a lossy representation in peripheral vision.重新思考自上而下的注意力在视觉中的作用:外周视觉中有损表征所产生的影响。
Front Psychol. 2012 Feb 6;3:13. doi: 10.3389/fpsyg.2012.00013. eCollection 2012.
3
Metamers of the ventral stream.腹侧流的同型物。
Nat Neurosci. 2011 Aug 14;14(9):1195-201. doi: 10.1038/nn.2889.
4
Contributions of ideal observer theory to vision research.理想观察者理论对视觉研究的贡献。
Vision Res. 2011 Apr 13;51(7):771-81. doi: 10.1016/j.visres.2010.09.027. Epub 2010 Nov 9.
5
A summary-statistic representation in peripheral vision explains visual crowding.外周视觉中的汇总统计表示解释了视觉拥挤现象。
J Vis. 2009 Nov 19;9(12):13.1-18. doi: 10.1167/9.12.13.
6
Frequency tuning of perceptual templates changes with noise magnitude.感知模板的频率调谐随噪声强度而变化。
J Opt Soc Am A Opt Image Sci Vis. 2009 Nov;26(11):B72-83. doi: 10.1364/JOSAA.26.000B72.
7
Saliency, attention, and visual search: an information theoretic approach.显著性、注意力与视觉搜索:一种信息论方法。
J Vis. 2009 Mar 13;9(3):5.1-24. doi: 10.1167/9.3.5.
8
Simple summation rule for optimal fixation selection in visual search.视觉搜索中最优注视点选择的简单求和规则。
Vision Res. 2009 Jun;49(10):1286-94. doi: 10.1016/j.visres.2008.12.005. Epub 2009 Jan 10.
9
The uncrowded window of object recognition.物体识别的宽松窗口期
Nat Neurosci. 2008 Oct;11(10):1129-35. doi: 10.1038/nn.2187.
10
Eye movement statistics in humans are consistent with an optimal search strategy.人类的眼球运动统计数据与一种最优搜索策略相一致。
J Vis. 2008 Mar 7;8(3):4.1-14. doi: 10.1167/8.3.4.