在预测观看驾驶视频时的人类注意力方面遗漏了什么？

What has been missed for predicting human attention in viewing driving clips?

作者信息

Xu Jiawei, Yue Shigang, Menchinelli Federica, Guo Kun

机构信息

School of Computer Science, University of Lincoln , Lincoln , United Kingdom.

School of Psychology, University of Lincoln , Lincoln , United Kingdom.

出版信息

PeerJ. 2017 Feb 1;5:e2946. doi: 10.7717/peerj.2946. eCollection 2017.

DOI:10.7717/peerj.2946

PMID:28168112

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5291110/

Abstract

Recent research progress on the topic of human visual attention allocation in scene perception and its simulation is based mainly on studies with static images. However, natural vision requires us to extract visual information that constantly changes due to egocentric movements or dynamics of the world. It is unclear to what extent spatio-temporal regularity, an inherent regularity in dynamic vision, affects human gaze distribution and saliency computation in visual attention models. In this free-viewing eye-tracking study we manipulated the spatio-temporal regularity of traffic videos by presenting them in normal video sequence, reversed video sequence, normal frame sequence, and randomised frame sequence. The recorded human gaze allocation was then used as the 'ground truth' to examine the predictive ability of a number of state-of-the-art visual attention models. The analysis revealed high inter-observer agreement across individual human observers, but all the tested attention models performed significantly worse than humans. The inferior predictability of the models was evident from indistinguishable gaze prediction irrespective of stimuli presentation sequence, and weak central fixation bias. Our findings suggest that a realistic visual attention model for the processing of dynamic scenes should incorporate human visual sensitivity with spatio-temporal regularity and central fixation bias.

摘要

近期关于场景感知中人类视觉注意力分配及其模拟这一主题的研究进展主要基于对静态图像的研究。然而，自然视觉要求我们提取由于自我中心运动或世界动态变化而不断变化的视觉信息。尚不清楚动态视觉中固有的时空规律性在多大程度上影响视觉注意力模型中的人类注视分布和显著性计算。在这项自由观看眼动追踪研究中，我们通过以正常视频序列、倒放视频序列、正常帧序列和随机帧序列呈现交通视频来操纵时空规律性。然后，将记录的人类注视分配用作“真实情况”，以检验一些先进视觉注意力模型的预测能力。分析表明，不同个体人类观察者之间的观察者间一致性较高，但所有测试的注意力模型的表现均明显不如人类。模型预测能力较差从以下方面可见一斑：无论刺激呈现序列如何，注视预测都无法区分，且中央注视偏差较弱。我们的研究结果表明，用于处理动态场景的现实视觉注意力模型应将人类视觉敏感性与时空规律性和中央注视偏差相结合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcd6/5291110/48bbb099466e/peerj-05-2946-g001.jpg

相似文献

What has been missed for predicting human attention in viewing driving clips?

PeerJ. 2017 Feb 1;5:e2946. doi: 10.7717/peerj.2946. eCollection 2017.

How does image noise affect actual and predicted human gaze allocation in assessing image quality?

Vision Res. 2015 Jul;112:11-25. doi: 10.1016/j.visres.2015.03.029. Epub 2015 May 14.

Attentional synchrony and the influence of viewing task on gaze behavior in static and dynamic scenes.

J Vis. 2013 Jul 17;13(8):16. doi: 10.1167/13.8.16.

Complementary effects of gaze direction and early saliency in guiding fixations during free viewing.

J Vis. 2014 Nov 4;14(13):3. doi: 10.1167/14.13.3.

Quantifying center bias of observers in free viewing of dynamic natural scenes.

J Vis. 2009 Jul 9;9(7):4. doi: 10.1167/9.7.4.

Free viewing of dynamic stimuli by humans and monkeys.

J Vis. 2009 May 19;9(5):19.1-15. doi: 10.1167/9.5.19.

Predicting visual fixations on video based on low-level visual features.

Vision Res. 2007 Sep;47(19):2483-98. doi: 10.1016/j.visres.2007.06.015. Epub 2007 Aug 3.

What stands out in a scene? A study of human explicit saliency judgment.

Vision Res. 2013 Oct 18;91:62-77. doi: 10.1016/j.visres.2013.07.016. Epub 2013 Aug 15.

How saliency, faces, and sound influence gaze in dynamic social scenes.

J Vis. 2014 Jul 3;14(8):5. doi: 10.1167/14.8.5.

Gaze distribution analysis and saliency prediction across age groups.

PLoS One. 2018 Feb 23;13(2):e0193149. doi: 10.1371/journal.pone.0193149. eCollection 2018.

本文引用的文献

Dog owners show experience-based viewing behaviour in judging dog face approachability.

Psychol Res. 2017 Jan;81(1):75-82. doi: 10.1007/s00426-015-0718-1. Epub 2015 Oct 20.

How does image noise affect actual and predicted human gaze allocation in assessing image quality?

Vision Res. 2015 Jul;112:11-25. doi: 10.1016/j.visres.2015.03.029. Epub 2015 May 14.

Facial expression training optimises viewing strategy in children and adults.

PLoS One. 2014 Aug 21;9(8):e105418. doi: 10.1371/journal.pone.0105418. eCollection 2014.

Role of lateral and feedback connections in primary visual cortex in the processing of spatiotemporal regularity - a TMS study.

Neuroscience. 2014 Mar 28;263:231-9. doi: 10.1016/j.neuroscience.2014.01.027. Epub 2014 Jan 23.

Beyond the tangent point: gaze targets in naturalistic driving.

J Vis. 2013 Nov 12;13(13):11. doi: 10.1167/13.13.11.

A value-driven mechanism of attentional selection.

J Vis. 2013 Apr 15;13(3):7. doi: 10.1167/13.3.7.

Quantitative analysis of human-model agreement in visual saliency modeling: a comparative study.

IEEE Trans Image Process. 2013 Jan;22(1):55-69. doi: 10.1109/TIP.2012.2210727. Epub 2012 Jul 30.

Context-aware saliency detection.

IEEE Trans Pattern Anal Mach Intell. 2012 Oct;34(10):1915-26. doi: 10.1109/TPAMI.2011.272.

Neuroscience. 2011 Sep 8;190:258-69. doi: 10.1016/j.neuroscience.2011.05.043. Epub 2011 Jun 2.

Eye guidance in natural vision: reinterpreting salience.

J Vis. 2011 May 27;11(5):5. doi: 10.1167/11.5.5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在预测观看驾驶视频时的人类注意力方面遗漏了什么？

What has been missed for predicting human attention in viewing driving clips?

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献