• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从杂乱图像中学习视角不变的感知表征。

Learning viewpoint invariant perceptual representations from cluttered images.

作者信息

Spratling Michael W

机构信息

Division of Engineering, King's College, London, UK.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):753-61. doi: 10.1109/TPAMI.2005.105.

DOI:10.1109/TPAMI.2005.105
PMID:15875796
Abstract

In order to perform object recognition, it is necessary to form perceptual representations that are sufficiently specific to distinguish between objects, but that are also sufficiently flexible to generalize across changes in location, rotation, and scale. A standard method for learning perceptual representations that are invariant to viewpoint is to form temporal associations across image sequences showing object transformations. However, this method requires that individual stimuli be presented in isolation and is therefore unlikely to succeed in real-world applications where multiple objects can co-occur in the visual input. This paper proposes a simple modification to the learning method that can overcome this limitation and results in more robust learning of invariant representations.

摘要

为了执行目标识别,有必要形成足够具体以区分不同目标,但同时也足够灵活以在位置、旋转和比例变化中进行泛化的感知表征。学习对视角不变的感知表征的一种标准方法是在显示目标变换的图像序列中形成时间关联。然而,这种方法要求单个刺激单独呈现,因此在视觉输入中可能同时出现多个目标的现实世界应用中不太可能成功。本文提出了一种对学习方法的简单修改,它可以克服这一限制,并导致对不变表征进行更稳健的学习。

相似文献

1
Learning viewpoint invariant perceptual representations from cluttered images.从杂乱图像中学习视角不变的感知表征。
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):753-61. doi: 10.1109/TPAMI.2005.105.
2
Affine invariant pattern recognition using Multiscale Autoconvolution.使用多尺度自卷积的仿射不变模式识别
IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):908-18. doi: 10.1109/TPAMI.2005.111.
3
Optimal linear representations of images for object recognition.用于目标识别的图像最优线性表示。
IEEE Trans Pattern Anal Mach Intell. 2004 May;26(5):662-6. doi: 10.1109/TPAMI.2004.1273986.
4
Sparse representation for coarse and fine object recognition.用于粗略和精细目标识别的稀疏表示。
IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):555-67. doi: 10.1109/TPAMI.2006.84.
5
Clutter invariant ATR.杂波不变自动目标识别
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):817-21. doi: 10.1109/TPAMI.2005.97.
6
Symmetric image registration.对称图像配准
Med Image Anal. 2006 Jun;10(3):484-93. doi: 10.1016/j.media.2005.03.003.
7
Three-dimensional model-based object recognition and segmentation in cluttered scenes.基于三维模型的杂乱场景中的目标识别与分割
IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1584-601. doi: 10.1109/TPAMI.2006.213.
8
A new image representation algorithm inspired by image submodality models, redundancy reduction, and learning in biological vision.一种受图像子模态模型、冗余减少以及生物视觉学习启发的新图像表示算法。
IEEE Trans Pattern Anal Mach Intell. 2005 Sep;27(9):1367-78. doi: 10.1109/TPAMI.2005.170.
9
On the removal of shadows from images.关于从图像中去除阴影。
IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):59-68. doi: 10.1109/TPAMI.2006.18.
10
Robust pose estimation and recognition using non-gaussian modeling of appearance subspaces.使用外观子空间的非高斯建模进行鲁棒姿态估计与识别。
IEEE Trans Pattern Anal Mach Intell. 2007 May;29(5):901-5. doi: 10.1109/TPAMI.2007.1028.

引用本文的文献

1
A Hierarchical Predictive Coding Model of Object Recognition in Natural Images.自然图像中物体识别的分层预测编码模型。
Cognit Comput. 2017;9(2):151-167. doi: 10.1007/s12559-016-9445-1. Epub 2016 Dec 28.
2
Unsupervised invariance learning of transformation sequences in a model of object recognition yields selectivity for non-accidental properties.在物体识别模型中,对变换序列进行无监督不变性学习可产生对非偶然属性的选择性。
Front Comput Neurosci. 2015 Oct 7;9:115. doi: 10.3389/fncom.2015.00115. eCollection 2015.
3
The Invariance Hypothesis Implies Domain-Specific Regions in Visual Cortex.
不变性假说意味着视觉皮层中存在特定领域的区域。
PLoS Comput Biol. 2015 Oct 23;11(10):e1004390. doi: 10.1371/journal.pcbi.1004390. eCollection 2015 Oct.
4
STDP in lateral connections creates category-based perceptual cycles for invariance learning with multiple stimuli.侧支连接中的突触时间依赖性可塑性为多种刺激的不变性学习创建基于类别的感知循环。
Biol Cybern. 2015 Apr;109(2):215-39. doi: 10.1007/s00422-014-0637-z. Epub 2014 Dec 9.
5
Slowness and sparseness have diverging effects on complex cell learning.迟缓与稀疏性对复杂细胞学习具有不同的影响。
PLoS Comput Biol. 2014 Mar 6;10(3):e1003468. doi: 10.1371/journal.pcbi.1003468. eCollection 2014 Mar.
6
How lateral connections and spiking dynamics may separate multiple objects moving together.侧向连接和尖峰动力学如何分离一起运动的多个物体。
PLoS One. 2013 Aug 2;8(8):e69952. doi: 10.1371/journal.pone.0069952. Print 2013.
7
Learning and disrupting invariance in visual recognition with a temporal association rule.利用时间关联规则学习和破坏视觉识别中的不变性。
Front Comput Neurosci. 2012 Jun 25;6:37. doi: 10.3389/fncom.2012.00037. eCollection 2012.
8
Relative spike time coding and STDP-based orientation selectivity in the early visual system in natural continuous and saccadic vision: a computational model.自然连续和扫视视觉中早期视觉系统基于相对尖峰时间编码和基于STDP的方向选择性:一种计算模型
J Comput Neurosci. 2012 Jun;32(3):425-41. doi: 10.1007/s10827-011-0361-9. Epub 2011 Sep 21.