• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

图像描述在视觉初始阶段的效用:以印刷文本为例的案例研究。

The utility of image descriptions in the initial stages of vision: a case study of printed text.

机构信息

Department of Psychology, University of Stirling, Scotland, UK.

出版信息

Br J Psychol. 2010 Feb;101(Pt 1):1-26. doi: 10.1348/000712608X379070. Epub 2009 Feb 14.

DOI:10.1348/000712608X379070
PMID:19220935
Abstract

Vision research has made very substantial progress towards understanding how we see. It is one area of psychology where the three-way thrust of behavioural measurements (psychophysics), brain imaging, and computational studies have been combined quite routinely for some years. The purpose of this paper is to demonstrate a relatively unusual form of computational modelling that we characterise as involving image descriptions. Image descriptions are statements about structures in images and relationships between structures. Most modelling in vision is either conceived in fairly abstract terms, or is done at the level of images. Neither is entirely satisfactory, and image descriptions are a simple formulation of age-old ideas about a Vocabulary of image features that are detected and parameterized from actual digital images. For our example, we use the domain of the visual perception of printed text. This is an area that has been characterized by thorough, robust psychophysical experiments. The fundamental requirements of visual processing in this domain are: grouping of some parts if the image into words; at the same time segmenting words from each other. We show how these are readily understood in terms of our model of image descriptions, and show quantitatively that typographical practice, refined over centuries, is about optimum for the visual system at least as represented by our model. In addition, we show that the same notion of image descriptions could, in principle, support word recognition in certain circumstances.

摘要

视觉研究在理解我们如何看方面已经取得了非常大的进展。它是心理学的一个领域,行为测量(心理物理学)、大脑成像和计算研究的三管齐下已经结合了好几年。本文的目的是展示一种相对不常见的计算建模形式,我们称之为涉及图像描述。图像描述是关于图像中的结构和结构之间关系的陈述。大多数视觉建模要么是在相当抽象的术语中构想的,要么是在图像层面上进行的。两者都不完全令人满意,而图像描述是对从实际数字图像中检测和参数化的图像特征词汇的古老思想的简单表述。对于我们的例子,我们使用印刷文本视觉感知的领域。这是一个经过彻底、稳健的心理物理实验所描述的领域。该领域视觉处理的基本要求是:将图像的某些部分组合成单词;同时将单词彼此分割。我们展示了如何根据我们的图像描述模型来理解这些要求,并且定量地表明,经过数百年的精炼,印刷实践对于我们的模型所代表的视觉系统至少是最优的。此外,我们还表明,在某些情况下,相同的图像描述概念可以支持单词识别。

相似文献

1
The utility of image descriptions in the initial stages of vision: a case study of printed text.图像描述在视觉初始阶段的效用:以印刷文本为例的案例研究。
Br J Psychol. 2010 Feb;101(Pt 1):1-26. doi: 10.1348/000712608X379070. Epub 2009 Feb 14.
2
Image descriptions in early and mid-level vision: what kind of model is this and what kind of models do we really need?早期和中期视觉中的图像描述:这是哪种模型,我们真正需要哪种模型?
Br J Psychol. 2010 Feb;101(Pt 1):27-32; author reply 41-6. doi: 10.1348/000712609X458053. Epub 2009 Jul 3.
3
Do image descriptions underlie word recognition in reading?图像描述是否是阅读中单词识别的基础?
Br J Psychol. 2010 Feb;101(Pt 1):33-9; author reply 41-6. doi: 10.1348/000712609X474730. Epub 2009 Oct 23.
4
Font adaptive word indexing of modern printed documents.现代印刷文档的字体自适应词索引
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1187-99. doi: 10.1109/TPAMI.2006.162.
5
The improbability of harris interest points.哈里斯兴趣点的不可能性。
IEEE Trans Pattern Anal Mach Intell. 2010 Jun;32(6):1141-7. doi: 10.1109/TPAMI.2010.53.
6
What is required for a signal to be qualified as a 'grouping' tag?一个信号需要满足什么条件才能被视为“分组”标签?
Br J Psychol. 2011 Aug;102(3):676-81; author reply 682-3. doi: 10.1111/j.2044-8295.2011.02022.x. Epub 2011 Apr 19.
7
Reduced-reference IQA in contourlet domain.轮廓波域中的简化参考图像质量评价
IEEE Trans Syst Man Cybern B Cybern. 2009 Dec;39(6):1623-7. doi: 10.1109/TSMCB.2009.2021951. Epub 2009 May 19.
8
Annotating images by mining image search results.通过挖掘图像搜索结果来标注图像。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.
9
Automatic image orientation detection via confidence-based integration of low-level and semantic cues.通过基于置信度的低级线索与语义线索整合实现自动图像方向检测
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):715-26. doi: 10.1109/TPAMI.2005.96.
10
On the role of medial geometry in human vision.论内侧几何学在人类视觉中的作用。
J Physiol Paris. 2003 Mar-May;97(2-3):155-90. doi: 10.1016/j.jphysparis.2003.09.003.

引用本文的文献

1
Locating the cortical bottleneck for slow reading in peripheral vision.定位周边视觉中慢速阅读的皮质瓶颈。
J Vis. 2015 Aug 1;15(11):3. doi: 10.1167/15.11.3.
2
Differences between Old and Young Adults' Ability to Recognize Human Faces Underlie Processing of Horizontal Information.老年人与年轻人识别人脸能力的差异是水平信息处理的基础。
Front Aging Neurosci. 2012 Apr 23;4:3. doi: 10.3389/fnagi.2012.00003. eCollection 2012.
3
Crowding follows the binding of relative position and orientation.拥挤现象遵循相对位置和方向的绑定。
J Vis. 2012 Mar 21;12(3):10.1167/12.3.18 18. doi: 10.1167/12.3.18.
4
The mechanism of word crowding.文字拥挤的机制。
Vision Res. 2012 Jan 1;52(1):61-9. doi: 10.1016/j.visres.2011.10.015. Epub 2011 Nov 7.
5
Horizontal information drives the behavioral signatures of face processing.水平信息驱动面部加工的行为特征。
Front Psychol. 2010 Sep 28;1:143. doi: 10.3389/fpsyg.2010.00143. eCollection 2010.
6
Do image descriptions underlie word recognition in reading?图像描述是否是阅读中单词识别的基础?
Br J Psychol. 2010 Feb;101(Pt 1):33-9; author reply 41-6. doi: 10.1348/000712609X474730. Epub 2009 Oct 23.