• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从全局图像特征估计场景布局属性的感知。

Estimating perception of scene layout properties from global image features.

作者信息

Ross Michael G, Oliva Aude

机构信息

Department of Brain & Cognitive Sciences, Massachusetts Institute of Technology, USA.

出版信息

J Vis. 2010 Jan 8;10(1):2.1-25. doi: 10.1167/10.1.2.

DOI:10.1167/10.1.2
PMID:20143895
Abstract

The relationship between image features and scene structure is central to the study of human visual perception and computer vision, but many of the specifics of real-world layout perception remain unknown. We do not know which image features are relevant to perceiving layout properties, or whether those features provide the same information for every type of image. Furthermore, we do not know the spatial resolutions required for perceiving different properties. This paper describes an experiment and a computational model that provides new insights on these issues. Humans perceive the global spatial layout properties such as dominant depth, openness, and perspective, from a single image. This work describes an algorithm that reliably predicts human layout judgments. This model's predictions are general, not specific to the observers it trained on. Analysis reveals that the optimal spatial resolutions for determining layout vary with the content of the space and the property being estimated. Openness is best estimated at high resolution, depth is best estimated at medium resolution, and perspective is best estimated at low resolution. Given the reliability and simplicity of estimating the global layout of real-world environments, this model could help resolve perceptual ambiguities encountered by more detailed scene reconstruction schemas.

摘要

图像特征与场景结构之间的关系是人类视觉感知和计算机视觉研究的核心,但现实世界布局感知的许多具体细节仍不为人知。我们不知道哪些图像特征与感知布局属性相关,也不知道这些特征是否为每种类型的图像提供相同的信息。此外,我们不知道感知不同属性所需的空间分辨率。本文描述了一项实验和一个计算模型,它们为这些问题提供了新的见解。人类从单张图像中感知诸如主导深度、开放性和透视等全局空间布局属性。这项工作描述了一种可靠预测人类布局判断的算法。该模型的预测具有普遍性,并非特定于其训练所依据的观察者。分析表明,确定布局的最佳空间分辨率会因空间内容和所估计的属性而异。开放性在高分辨率下估计最佳,深度在中等分辨率下估计最佳,透视在低分辨率下估计最佳。鉴于估计现实世界环境全局布局的可靠性和简易性,该模型有助于解决更详细的场景重建模式所遇到的感知模糊性问题。

相似文献

1
Estimating perception of scene layout properties from global image features.从全局图像特征估计场景布局属性的感知。
J Vis. 2010 Jan 8;10(1):2.1-25. doi: 10.1167/10.1.2.
2
Building the gist of a scene: the role of global image features in recognition.构建场景要点:全局图像特征在识别中的作用。
Prog Brain Res. 2006;155:23-36. doi: 10.1016/S0079-6123(06)55002-2.
3
Scene categorization at large visual eccentricities.大视觉离心率下的场景分类
Vision Res. 2013 Jun 28;86:35-42. doi: 10.1016/j.visres.2013.04.006. Epub 2013 Apr 15.
4
Binocular depth perception from unpaired image points need not depend on scene organization.来自非配对图像点的双眼深度感知不一定依赖于场景组织。
Vision Res. 2005 Mar;45(5):527-32. doi: 10.1016/j.visres.2004.09.015.
5
Peripheral stimulation and its effect on perceived spatial scale in virtual environments.外周刺激及其对虚拟环境中感知空间尺度的影响。
IEEE Trans Vis Comput Graph. 2013 Apr;19(4):701-10. doi: 10.1109/TVCG.2013.37.
6
Scene layout priming relies primarily on low-level features rather than scene layout.场景布局启动主要依赖于低级特征而非场景布局。
J Vis. 2019 Jan 2;19(1):14. doi: 10.1167/19.1.14.
7
Sensitivity to luminance and chromaticity gradients in a complex scene.复杂场景中对亮度和色度梯度的敏感度。
J Vis. 2008 Jul 10;8(9):3.1-16. doi: 10.1167/8.9.3.
8
Egocentric depth judgments in optical, see-through augmented reality.光学透明增强现实中的自我中心深度判断
IEEE Trans Vis Comput Graph. 2007 May-Jun;13(3):429-42. doi: 10.1109/TVCG.2007.1035.
9
The influence of motion-defined form on the perception of spatially-defined form.运动定义形式对空间定义形式感知的影响。
Vision Res. 2004 May;44(11):1065-77. doi: 10.1016/j.visres.2004.01.003.
10
Environmental surfaces and the compression of perceived visual space.环境表面与感知视觉空间的压缩
J Vis. 2011 Jun 7;11(7):4. doi: 10.1167/11.7.4.

引用本文的文献

1
Post-Saccadic Disruption of Semantic Category Information in Naturalistic Scenes.自然场景中扫视后语义类别信息的破坏
bioRxiv. 2025 Jun 10:2025.06.06.658316. doi: 10.1101/2025.06.06.658316.
2
Maintaining visual stability in naturalistic scenes: The roles of trans-saccadic memory and default assumptions.在自然场景中保持视觉稳定性:跨眼跳记忆和默认假设的作用。
Cognition. 2025 Sep;262:106165. doi: 10.1016/j.cognition.2025.106165. Epub 2025 May 10.
3
Preliminary Evidence for Global Properties in Human Listeners During Natural Auditory Scene Perception.
人类听众在自然听觉场景感知过程中全局特性的初步证据。
Open Mind (Camb). 2024 Mar 26;8:333-365. doi: 10.1162/opmi_a_00131. eCollection 2024.
4
Memorability-based multimedia analytics for robotic interestingness prediction system using trimmed Q-learning algorithm.基于记忆性的多媒体分析在使用修剪Q学习算法的机器人趣味性预测系统中的应用
Sci Rep. 2023 Nov 13;13(1):19799. doi: 10.1038/s41598-023-44553-1.
5
Contour features predict valence and threat judgements in scenes.轮廓特征可预测场景中的情绪效价和威胁判断。
Sci Rep. 2021 Sep 30;11(1):19405. doi: 10.1038/s41598-021-99044-y.
6
Category systems for real-world scenes.真实场景的分类系统。
J Vis. 2021 Feb 3;21(2):8. doi: 10.1167/jov.21.2.8.
7
Parametric Coding of the Size and Clutter of Natural Scenes in the Human Brain.人脑中自然场景大小与杂乱程度的参数编码
Cereb Cortex. 2015 Jul;25(7):1792-805. doi: 10.1093/cercor/bht418. Epub 2014 Jan 15.
8
Multiple object properties drive scene-selective regions.多个物体属性驱动场景选择区域。
Cereb Cortex. 2014 Apr;24(4):883-97. doi: 10.1093/cercor/bhs364. Epub 2012 Dec 4.
9
Spatially pooled contrast responses predict neural and perceptual similarity of naturalistic image categories.空间池化对比响应可预测自然图像类别的神经和感知相似性。
PLoS Comput Biol. 2012;8(10):e1002726. doi: 10.1371/journal.pcbi.1002726. Epub 2012 Oct 18.
10
Real-world scene representations in high-level visual cortex: it's the spaces more than the places.高级视觉皮层中的真实场景表示:重要的是空间而不是地点。
J Neurosci. 2011 May 18;31(20):7322-33. doi: 10.1523/JNEUROSCI.4588-10.2011.