• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个学习识别三维物体的网络。

A network that learns to recognize three-dimensional objects.

作者信息

Poggio T, Edelman S

机构信息

Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge 02139.

出版信息

Nature. 1990 Jan 18;343(6255):263-6. doi: 10.1038/343263a0.

DOI:10.1038/343263a0
PMID:2300170
Abstract

The visual recognition of three-dimensional (3-D) objects on the basis of their shape poses at least two difficult problems. First, there is the problem of variable illumination, which can be addressed by working with relatively stable features such as intensity edges rather than the raw intensity images. Second, there is the problem of the initially unknown pose of the object relative to the viewer. In one approach to this problem, a hypothesis is first made about the viewpoint, then the appearance of a model object from such a viewpoint is computed and compared with the actual image. Such recognition schemes generally employ 3-D models of objects, but the automatic learning of 3-D models is itself a difficult problem. To address this problem in computational vision, we have developed a scheme, based on the theory of approximation of multivariate functions, that learns from a small set of perspective views a function mapping any viewpoint to a standard view. A network equivalent to this scheme will thus 'recognize' the object on which it was trained from any viewpoint.

摘要

基于三维(3-D)物体的形状进行视觉识别至少存在两个难题。首先是光照变化问题,这可以通过使用相对稳定的特征(如强度边缘)而非原始强度图像来解决。其次是物体相对于观察者的初始姿态未知的问题。针对这个问题的一种方法是,首先对视角做出假设,然后计算从该视角观察到的模型物体的外观,并与实际图像进行比较。此类识别方案通常采用物体的三维模型,但三维模型的自动学习本身就是一个难题。为了解决计算视觉中的这个问题,我们基于多元函数逼近理论开发了一种方案,该方案从一小组透视图中学习一个将任何视角映射到标准视图的函数。因此,与该方案等效的网络将能够从任何视角“识别”其训练所基于的物体。

相似文献

1
A network that learns to recognize three-dimensional objects.一个学习识别三维物体的网络。
Nature. 1990 Jan 18;343(6255):263-6. doi: 10.1038/343263a0.
2
3-D object recognition using 2-D views.使用二维视图进行三维物体识别。
IEEE Trans Image Process. 2008 Nov;17(11):2236-55. doi: 10.1109/TIP.2008.2003404.
3
Learning to recognize three-dimensional objects.学习识别三维物体。
Neural Comput. 2002 May;14(5):1071-103. doi: 10.1162/089976602753633394.
4
Analysis of cluttered scenes using an elastic matching approach for stereo images.使用弹性匹配方法对立体图像的杂乱场景进行分析。
Neural Comput. 2006 Jun;18(6):1441-71. doi: 10.1162/neco.2006.18.6.1441.
5
Face recognition from a single training image under arbitrary unknown lighting using spherical harmonics.利用球谐函数在任意未知光照条件下从单张训练图像进行人脸识别。
IEEE Trans Pattern Anal Mach Intell. 2006 Mar;28(3):351-63. doi: 10.1109/TPAMI.2006.53.
6
Learning to recognize objects on the fly: a neurally based dynamic field approach.动态学习识别物体:基于神经的动态场方法。
Neural Netw. 2008 May;21(4):562-76. doi: 10.1016/j.neunet.2008.03.007. Epub 2008 Apr 27.
7
Low-level aspects of segmentation and recognition.分割与识别的低级方面。
Philos Trans R Soc Lond B Biol Sci. 1992 Sep 29;337(1281):371-8; discussion 379. doi: 10.1098/rstb.1992.0115.
8
Invariant object recognition in the visual system with novel views of 3D objects.视觉系统中具有三维物体新视角的不变物体识别
Neural Comput. 2002 Nov;14(11):2585-96. doi: 10.1162/089976602760407982.
9
View-invariant object category learning, recognition, and search: how spatial and object attention are coordinated using surface-based attentional shrouds.视图不变物体类别学习、识别与搜索:基于表面的注意力罩如何协调空间和物体注意力。
Cogn Psychol. 2009 Feb;58(1):1-48. doi: 10.1016/j.cogpsych.2008.05.001. Epub 2008 Jul 23.
10
Sparse representation for coarse and fine object recognition.用于粗略和精细目标识别的稀疏表示。
IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):555-67. doi: 10.1109/TPAMI.2006.84.

引用本文的文献

1
The Oomplet dataset toolkit as a flexible and extensible system for large-scale, multi-category image generation.Oomplet数据集工具包是一个用于大规模、多类别图像生成的灵活且可扩展的系统。
Sci Rep. 2025 Mar 18;15(1):9287. doi: 10.1038/s41598-025-93036-y.
2
How well do rudimentary plasticity rules predict adult visual object learning?基本的可塑性规则在多大程度上可以预测成人的视觉物体学习?
PLoS Comput Biol. 2023 Dec 11;19(12):e1011713. doi: 10.1371/journal.pcbi.1011713. eCollection 2023 Dec.
3
Viewpoint dependence and scene context effects generalize to depth rotated three-dimensional objects.
视角依赖和场景上下文效应对深度旋转的三维物体具有泛化性。
J Vis. 2023 Sep 1;23(10):9. doi: 10.1167/jov.23.10.9.
4
Macaques recognize features in synthetic images derived from ventral stream neurons.猕猴能够识别出腹侧流神经元衍生的合成图像中的特征。
Proc Natl Acad Sci U S A. 2023 Mar 7;120(10):e2213034120. doi: 10.1073/pnas.2213034120. Epub 2023 Mar 1.
5
Challenging the Classical View: Recognition of Identity and Expression as Integrated Processes.挑战传统观点:将身份识别与表达视为整合过程
Brain Sci. 2023 Feb 10;13(2):296. doi: 10.3390/brainsci13020296.
6
Individual differences in classification images of Mooney faces.穆尼人脸分类图像的个体差异。
J Vis. 2022 Dec 1;22(13):3. doi: 10.1167/jov.22.13.3.
7
Three-dimensional pose discrimination in natural images of humans.人类自然图像中的三维姿态辨别
Cogsci. 2021 Jul;43:223-229.
8
Maximal Dependence Capturing as a Principle of Sensory Processing.将最大依赖性捕获作为感觉处理的一个原则。
Front Comput Neurosci. 2022 Mar 25;16:857653. doi: 10.3389/fncom.2022.857653. eCollection 2022.
9
Using eye-tracking to parse object recognition: Priming activates primarily a parts-based but also a late-emerging features-based representation.利用眼动追踪解析物体识别:启动主要激活基于部分的表征,但也激活后期出现的基于特征的表征。
Atten Percept Psychophys. 2020 Aug;82(6):3096-3111. doi: 10.3758/s13414-020-02040-z.
10
View-Tolerant Face Recognition and Hebbian Learning Imply Mirror-Symmetric Neural Tuning to Head Orientation.视角容忍人脸识别与赫布学习意味着头部朝向的镜像对称神经调节。
Curr Biol. 2017 Jan 9;27(1):62-67. doi: 10.1016/j.cub.2016.10.015. Epub 2016 Dec 1.