• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用感知分组约束对多模态场景表示进行歧义消解。

Disambiguating multi-modal scene representations using perceptual grouping constraints.

机构信息

Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, UK.

出版信息

PLoS One. 2010 Jun 9;5(6):e10663. doi: 10.1371/journal.pone.0010663.

DOI:10.1371/journal.pone.0010663
PMID:20544006
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2882939/
Abstract

In its early stages, the visual system suffers from a lot of ambiguity and noise that severely limits the performance of early vision algorithms. This article presents feedback mechanisms between early visual processes, such as perceptual grouping, stereopsis and depth reconstruction, that allow the system to reduce this ambiguity and improve early representation of visual information. In the first part, the article proposes a local perceptual grouping algorithm that - in addition to commonly used geometric information - makes use of a novel multi-modal measure between local edge/line features. The grouping information is then used to: 1) disambiguate stereopsis by enforcing that stereo matches preserve groups; and 2) correct the reconstruction error due to the image pixel sampling using a linear interpolation over the groups. The integration of mutual feedback between early vision processes is shown to reduce considerably ambiguity and noise without the need for global constraints.

摘要

在早期阶段,视觉系统受到大量的模糊性和噪声的影响,这严重限制了早期视觉算法的性能。本文提出了早期视觉过程之间的反馈机制,例如知觉分组、立体视和深度重建,这些机制允许系统减少这种模糊性并改善视觉信息的早期表示。在第一部分,本文提出了一种局部知觉分组算法,该算法除了常用的几何信息外,还利用了局部边缘/线特征之间的一种新颖的多模态度量。然后,分组信息用于:1)通过强制立体匹配保持分组来消除立体视的歧义;2)使用组上的线性插值来纠正由于图像像素采样而导致的重建误差。早期视觉过程之间的相互反馈的集成被证明可以在不需要全局约束的情况下大大减少模糊性和噪声。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/5f07263e59d9/pone.0010663.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/1e00db6263f6/pone.0010663.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/affcdd478abd/pone.0010663.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/462b8355c5d6/pone.0010663.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/1ee94474368e/pone.0010663.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/cda7a930fbf7/pone.0010663.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/1ac68f1b7a81/pone.0010663.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/0c622b9496b1/pone.0010663.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/c5d1f57521d6/pone.0010663.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/eb82207b879f/pone.0010663.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/fc6e42d1820e/pone.0010663.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/e6caf8d61a9f/pone.0010663.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/5975523d5c29/pone.0010663.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/42c355a3e892/pone.0010663.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/53d0cd44ba6c/pone.0010663.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/7c49aa2ba045/pone.0010663.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/0860b3aaaa43/pone.0010663.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/c5ea5b3fa6b1/pone.0010663.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/a167fd8d575e/pone.0010663.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/5f07263e59d9/pone.0010663.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/1e00db6263f6/pone.0010663.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/affcdd478abd/pone.0010663.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/462b8355c5d6/pone.0010663.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/1ee94474368e/pone.0010663.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/cda7a930fbf7/pone.0010663.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/1ac68f1b7a81/pone.0010663.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/0c622b9496b1/pone.0010663.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/c5d1f57521d6/pone.0010663.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/eb82207b879f/pone.0010663.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/fc6e42d1820e/pone.0010663.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/e6caf8d61a9f/pone.0010663.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/5975523d5c29/pone.0010663.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/42c355a3e892/pone.0010663.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/53d0cd44ba6c/pone.0010663.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/7c49aa2ba045/pone.0010663.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/0860b3aaaa43/pone.0010663.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/c5ea5b3fa6b1/pone.0010663.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/a167fd8d575e/pone.0010663.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e9/2882939/5f07263e59d9/pone.0010663.g019.jpg

相似文献

1
Disambiguating multi-modal scene representations using perceptual grouping constraints.使用感知分组约束对多模态场景表示进行歧义消解。
PLoS One. 2010 Jun 9;5(6):e10663. doi: 10.1371/journal.pone.0010663.
2
When does visual perceptual grouping affect multisensory integration?视觉感知分组何时会影响多感官整合?
Cogn Affect Behav Neurosci. 2004 Jun;4(2):218-29. doi: 10.3758/cabn.4.2.218.
3
Time course of perceptual grouping by color.基于颜色的知觉分组的时间进程。
Psychol Sci. 2003 Jan;14(1):26-30. doi: 10.1111/1467-9280.01414.
4
Cortical algorithms for perceptual grouping.用于感知分组的皮层算法。
Annu Rev Neurosci. 2006;29:203-27. doi: 10.1146/annurev.neuro.29.051605.112939.
5
Opposite modulation of high- and low-level visual aftereffects by perceptual grouping.知觉组织对高低水平视觉后效的相反调制。
Curr Biol. 2012 Jun 5;22(11):1040-5. doi: 10.1016/j.cub.2012.04.026. Epub 2012 May 10.
6
The role of global cues in the perceptual grouping of natural shapes.全局线索在自然形状感知分组中的作用。
J Vis. 2018 Nov 1;18(12):14. doi: 10.1167/18.12.14.
7
A laminar cortical model of stereopsis and 3D surface perception: closure and da Vinci stereopsis.立体视觉和三维表面感知的层状皮质模型:闭合与达·芬奇立体视觉。
Spat Vis. 2005;18(5):515-78. doi: 10.1163/156856805774406756.
8
Stereopsis and 3D surface perception by spiking neurons in laminar cortical circuits: a method for converting neural rate models into spiking models.层状皮质电路中尖峰神经元的立体视和 3D 表面感知:将神经率模型转换为尖峰模型的方法。
Neural Netw. 2012 Feb;26:75-98. doi: 10.1016/j.neunet.2011.10.010. Epub 2011 Nov 4.
9
Visual information processing: the structure and creation of visual representations.视觉信息处理:视觉表征的结构与形成
Philos Trans R Soc Lond B Biol Sci. 1980 Jul 8;290(1038):199-218. doi: 10.1098/rstb.1980.0091.
10
Visual training improves perceptual grouping based on basic stimulus features.视觉训练可改善基于基本刺激特征的知觉分组。
Atten Percept Psychophys. 2017 Oct;79(7):2098-2107. doi: 10.3758/s13414-017-1368-8.

引用本文的文献

1
Accurate and Fast Convergent Initial-Value Belief Propagation for Stereo Matching.用于立体匹配的精确快速收敛初始值置信传播算法
PLoS One. 2015 Sep 8;10(9):e0137530. doi: 10.1371/journal.pone.0137530. eCollection 2015.
2
Luminance, Colour, Viewpoint and Border Enhanced Disparity Energy Model.亮度、颜色、视角和边界增强视差能量模型
PLoS One. 2015 Jun 24;10(6):e0129908. doi: 10.1371/journal.pone.0129908. eCollection 2015.

本文引用的文献

1
Stereo by intra- and inter-scanline search using dynamic programming.使用动态规划进行行间和行间搜索的立体匹配。
IEEE Trans Pattern Anal Mach Intell. 1985 Feb;7(2):139-54. doi: 10.1109/tpami.1985.4767639.
2
Multi-scale lines and edges in V1 and beyond: brightness, object categorization and recognition, and consciousness.初级视觉皮层及其他区域的多尺度线条与边缘:亮度、物体分类与识别以及意识。
Biosystems. 2009 Mar;95(3):206-26. doi: 10.1016/j.biosystems.2008.10.006. Epub 2008 Nov 5.
3
Multi-scale keypoints in V1 and beyond: object segregation, scale selection, saliency maps and face detection.
初级视觉皮层及更高级区域中的多尺度关键点:物体分离、尺度选择、显著性图与面部检测。
Biosystems. 2006 Oct-Dec;86(1-3):75-90. doi: 10.1016/j.biosystems.2006.02.019. Epub 2006 Apr 7.
4
Symbols as self-emergent entities in an optimization process of feature extraction and predictions.在特征提取和预测的优化过程中作为自涌现实体的符号。
Biol Cybern. 2006 Apr;94(4):325-34. doi: 10.1007/s00422-006-0050-3. Epub 2006 Feb 23.
5
Performance evaluation of local descriptors.局部描述符的性能评估
IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1615-30. doi: 10.1109/TPAMI.2005.188.
6
Ecological cue-validity of proximity and of other Gestalt factors.接近性及其他格式塔因素的生态线索有效性。
Am J Psychol. 1953 Jan;66(1):20-32.
7
Ecological statistics of Gestalt laws for the perceptual organization of contours.用于轮廓感知组织的格式塔法则的生态统计学
J Vis. 2002;2(4):324-53. doi: 10.1167/2.4.5.
8
Edge co-occurrence in natural images predicts contour grouping performance.自然图像中的边缘共现可预测轮廓分组性能。
Vision Res. 2001 Mar;41(6):711-24. doi: 10.1016/s0042-6989(00)00277-7.
9
Contour integration by the human visual system: evidence for a local "association field".人类视觉系统的轮廓整合:局部“关联场”的证据。
Vision Res. 1993 Jan;33(2):173-93. doi: 10.1016/0042-6989(93)90156-q.
10
Representation of local geometry in the visual system.视觉系统中局部几何形状的表征。
Biol Cybern. 1987;55(6):367-75. doi: 10.1007/BF00318371.