视觉模式分类中的主动感知。

Active sensing in the categorization of visual patterns.

作者信息

Yang Scott Cheng-Hsin, Lengyel Máté, Wolpert Daniel M

机构信息

Computational and Biological Learning Lab, Department of Engineering, University of Cambridge, Cambridge, United Kingdom.

Department of Cognitive Science, Central European University, Budapest, Hungary.

出版信息

Elife. 2016 Feb 10;5:e12215. doi: 10.7554/eLife.12215.

DOI:10.7554/eLife.12215

PMID:26880546

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4764587/

Abstract

Interpreting visual scenes typically requires us to accumulate information from multiple locations in a scene. Using a novel gaze-contingent paradigm in a visual categorization task, we show that participants' scan paths follow an active sensing strategy that incorporates information already acquired about the scene and knowledge of the statistical structure of patterns. Intriguingly, categorization performance was markedly improved when locations were revealed to participants by an optimal Bayesian active sensor algorithm. By using a combination of a Bayesian ideal observer and the active sensor algorithm, we estimate that a major portion of this apparent suboptimality of fixation locations arises from prior biases, perceptual noise and inaccuracies in eye movements, and the central process of selecting fixation locations is around 70% efficient in our task. Our results suggest that participants select eye movements with the goal of maximizing information about abstract categories that require the integration of information from multiple locations.

摘要

解读视觉场景通常需要我们从场景中的多个位置积累信息。在视觉分类任务中使用一种新颖的注视相关范式，我们发现参与者的扫描路径遵循一种主动感知策略，该策略整合了已经获取的关于场景的信息以及模式统计结构的知识。有趣的是，当通过最优贝叶斯主动传感器算法向参与者揭示位置时，分类性能显著提高。通过结合贝叶斯理想观察者和主动传感器算法，我们估计注视位置这种明显的次优性的主要部分源于先验偏差、感知噪声和眼动不准确，并且在我们的任务中选择注视位置的核心过程效率约为70%。我们的结果表明，参与者选择眼动的目标是最大化关于需要整合来自多个位置信息的抽象类别的信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae43/4764587/26ea695f9a24/elife-12215-fig1.jpg

相似文献

Active sensing in the categorization of visual patterns.

Elife. 2016 Feb 10;5:e12215. doi: 10.7554/eLife.12215.

Peripheral guidance in scenes: The interaction of scene context and object content.

J Exp Psychol Hum Percept Perform. 2014 Oct;40(5):2056-72. doi: 10.1037/a0037524. Epub 2014 Aug 4.

Differences of eye movement pattern in natural and man-made scenes and image categorization with the help of these patterns.

J Integr Neurosci. 2016 Mar;15(1):37-54. doi: 10.1142/S0219635216500023. Epub 2015 Oct 6.

Beyond gist: strategic and incremental information accumulation for scene categorization.

Psychol Sci. 2014 May 1;25(5):1087-97. doi: 10.1177/0956797614522816. Epub 2014 Mar 6.

Goal-oriented gaze strategies afforded by object interaction.

Vision Res. 2015 Jan;106:47-57. doi: 10.1016/j.visres.2014.11.003. Epub 2014 Nov 21.

Observers' cognitive states modulate how visual inputs relate to gaze control.

J Exp Psychol Hum Percept Perform. 2016 Sep;42(9):1429-42. doi: 10.1037/xhp0000224. Epub 2016 Apr 28.

Optimal eye movement strategies in visual search.

Nature. 2005 Mar 17;434(7031):387-91. doi: 10.1038/nature03390.

Developmental visual perception deficits with no indications of prosopagnosia in a child with abnormal eye movements.

Neuropsychologia. 2017 Jun;100:64-78. doi: 10.1016/j.neuropsychologia.2017.04.014. Epub 2017 Apr 9.

Eye movements in iconic visual search.

Vision Res. 2002 May;42(11):1447-63. doi: 10.1016/s0042-6989(02)00040-8.

Oculomotor strategies for the direction of gaze tested with a real-world activity.

Vision Res. 2003 Feb;43(3):333-46. doi: 10.1016/s0042-6989(02)00498-4.

引用本文的文献

Understanding human visual foraging: a review.

Biol Cybern. 2025 Jul 23;119(4-6):20. doi: 10.1007/s00422-025-01020-6.

Applying movement ecology models to comparative cognition experiments: a field test in hummingbirds.

Proc Biol Sci. 2025 Jul;292(2051):20250717. doi: 10.1098/rspb.2025.0717. Epub 2025 Jul 23.

Generative models for sequential dynamics in active inference.

Cogn Neurodyn. 2024 Dec;18(6):3259-3272. doi: 10.1007/s11571-023-09963-x. Epub 2023 Apr 26.

Just-in-time: Gaze guidance in natural behavior.

PLoS Comput Biol. 2024 Oct 24;20(10):e1012529. doi: 10.1371/journal.pcbi.1012529. eCollection 2024 Oct.

A computational approach to selective attention in embodied approaches to cognitive archaeology.

J R Soc Interface. 2024 Oct;21(219):20240508. doi: 10.1098/rsif.2024.0508. Epub 2024 Oct 9.

Sensory adaptation in the barrel cortex during active sensation in the behaving mouse.

Sci Rep. 2024 Sep 16;14(1):21588. doi: 10.1038/s41598-024-70524-1.

Sound-seeking before and after hearing loss in mice.

Sci Rep. 2024 Aug 19;14(1):19181. doi: 10.1038/s41598-024-67577-7.

Active sensing with predictive coding and uncertainty minimization.

Patterns (N Y). 2024 May 3;5(6):100983. doi: 10.1016/j.patter.2024.100983. eCollection 2024 Jun 14.

Eye movements reflect active statistical learning.

J Vis. 2024 May 1;24(5):17. doi: 10.1167/jov.24.5.17.

Sound-seeking before and after hearing loss in mice.

bioRxiv. 2024 Jan 9:2024.01.08.574475. doi: 10.1101/2024.01.08.574475.

本文引用的文献

Cognitive tomography reveals complex, task-independent mental representations.

Curr Biol. 2013 Nov 4;23(21):2169-75. doi: 10.1016/j.cub.2013.09.012.

Memory representations in natural tasks.

J Cogn Neurosci. 1995 Winter;7(1):66-80. doi: 10.1162/jocn.1995.7.1.66.

Microscopic eye movements compensate for nonhomogeneous vision within the fovea.

Curr Biol. 2013 Sep 9;23(17):1691-5. doi: 10.1016/j.cub.2013.07.007. Epub 2013 Aug 15.

Optimal sampling of visual information for lightness judgments.

Proc Natl Acad Sci U S A. 2013 Jul 2;110(27):11163-8. doi: 10.1073/pnas.1216954110. Epub 2013 Jun 17.

Learning where to look for a hidden target.

Proc Natl Acad Sci U S A. 2013 Jun 18;110 Suppl 2(Suppl 2):10438-45. doi: 10.1073/pnas.1301216110. Epub 2013 Jun 10.

Object learning improves feature extraction but does not improve feature selection.

PLoS One. 2012;7(12):e51325. doi: 10.1371/journal.pone.0051325. Epub 2012 Dec 12.

Looking just below the eyes is optimal across face recognition tasks.

Proc Natl Acad Sci U S A. 2012 Nov 27;109(48):E3314-23. doi: 10.1073/pnas.1214269109. Epub 2012 Nov 12.

A probabilistic model of eye movements in concept formation.

Neurocomputing (Amst). 2007 Aug 1;70(13-15):2256-2272. doi: 10.1016/j.neucom.2006.02.026. Epub 2007 Jan 2.

Dynamic integration of information about salience and value for saccadic eye movements.

Proc Natl Acad Sci U S A. 2012 May 8;109(19):7547-52. doi: 10.1073/pnas.1115638109. Epub 2012 Apr 23.

Orientation of noisy texture affects saccade direction during free viewing.

Vision Res. 2012 Apr;58:19-26. doi: 10.1016/j.visres.2012.02.003. Epub 2012 Feb 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

视觉模式分类中的主动感知。

Active sensing in the categorization of visual patterns.

作者信息

Yang Scott Cheng-Hsin, Lengyel Máté, Wolpert Daniel M

机构信息

Computational and Biological Learning Lab, Department of Engineering, University of Cambridge, Cambridge, United Kingdom.

Department of Cognitive Science, Central European University, Budapest, Hungary.

出版信息

Elife. 2016 Feb 10;5:e12215. doi: 10.7554/eLife.12215.

DOI:10.7554/eLife.12215

PMID:26880546

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4764587/

Abstract

摘要

视觉模式分类中的主动感知。

Active sensing in the categorization of visual patterns.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

视觉模式分类中的主动感知。

Active sensing in the categorization of visual patterns.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献