• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视觉系统中大小和旋转不变模式处理的模型。

A model for size- and rotation-invariant pattern processing in the visual system.

作者信息

Reitboeck H J, Altmann J

出版信息

Biol Cybern. 1984;51(2):113-21. doi: 10.1007/BF00357924.

DOI:10.1007/BF00357924
PMID:6509123
Abstract

The mapping of retinal space onto the striate cortex of some mammals can be approximated by a log-polar function. It has been proposed that this mapping is of functional importance for scale- and rotation-invariant pattern recognition in the visual system. An exact log-polar transform converts centered scaling and rotation into translations. A subsequent translation-invariant transform, such as the absolute value of the Fourier transform, thus generates overall size- and rotation-invariance. In our model, the translation-invariance is realized via the R-transform. This transform can be executed by simple neural networks, and it does not require the complex computations of the Fourier transform, used in Mellin-transform size-invariance models. The logarithmic space distortion and differentiation in the first processing stage of the model is realized via "Mexican hat" filters whose diameter increases linearly with eccentricity, similar to the characteristics of the receptive fields of retinal ganglion cells. Except for some special cases, the model can explain object recognition independent of size, orientation and position. Some general problems of Mellin-type size-invariance models-that also apply to our model-are discussed.

摘要

某些哺乳动物视网膜空间到纹状皮质的映射可以用对数极坐标函数近似表示。有人提出,这种映射对于视觉系统中尺度和旋转不变的模式识别具有重要的功能意义。精确的对数极坐标变换将中心缩放和旋转转换为平移。随后的平移不变变换,如傅里叶变换的绝对值,从而产生整体大小和旋转不变性。在我们的模型中,平移不变性是通过R变换实现的。这种变换可以由简单的神经网络执行,并且不需要梅林变换大小不变性模型中使用的傅里叶变换的复杂计算。模型第一处理阶段的对数空间扭曲和微分是通过“墨西哥帽”滤波器实现的,其直径随偏心率线性增加,类似于视网膜神经节细胞感受野的特征。除了一些特殊情况外,该模型可以解释与大小、方向和位置无关的物体识别。还讨论了梅林型大小不变性模型的一些普遍问题——这些问题也适用于我们的模型。

相似文献

1
A model for size- and rotation-invariant pattern processing in the visual system.视觉系统中大小和旋转不变模式处理的模型。
Biol Cybern. 1984;51(2):113-21. doi: 10.1007/BF00357924.
2
Invariant visual responses from attentional gain fields.来自注意力增益场的不变视觉反应。
J Neurophysiol. 1997 Jun;77(6):3267-72. doi: 10.1152/jn.1997.77.6.3267.
3
Experiments on pattern recognition using invariant Fourier-Mellin descriptors.使用不变傅里叶 - 梅林描述符进行模式识别的实验。
J Opt Soc Am A. 1986 Jun;3(6):771-6. doi: 10.1364/josaa.3.000771.
4
A Fast Correlation Method for Scale-and Translation-Invariant Pattern Recognition.一种用于尺度和平移不变模式识别的快速相关方法。
IEEE Trans Pattern Anal Mach Intell. 1984 Jan;6(1):46-57. doi: 10.1109/tpami.1984.4767474.
5
The representation of perceived angular size in human primary visual cortex.人类初级视觉皮层中感知角大小的表征。
Nat Neurosci. 2006 Mar;9(3):429-34. doi: 10.1038/nn1641. Epub 2006 Feb 5.
6
Invariant visual object recognition: a model, with lighting invariance.不变视觉对象识别:一种具有光照不变性的模型。
J Physiol Paris. 2006 Jul-Sep;100(1-3):43-62. doi: 10.1016/j.jphysparis.2006.09.004. Epub 2006 Oct 30.
7
Relationship between preferred orientation and receptive field position of neurons in extrastriate cortex (area 19) in the cat.猫纹外皮层(19区)神经元的优势取向与感受野位置之间的关系。
J Comp Neurol. 1984 Jan 20;222(3):445-51. doi: 10.1002/cne.902220309.
8
The relationship between log-complex transforms of stimuli and the cortical responses they evoke.刺激的对数复变变换与其所引发的皮层反应之间的关系。
Vision Res. 1986;26(12):1909-23. doi: 10.1016/0042-6989(86)90117-3.
9
Cortical magnification, scale invariance and visual ecology.皮质放大率、尺度不变性与视觉生态学
Vision Res. 1996 Sep;36(18):2971-7. doi: 10.1016/0042-6989(95)00344-4.
10
Visual evoked responses and retinal eccentricity.视觉诱发电位与视网膜偏心度。
Ann N Y Acad Sci. 1982;388:648-50. doi: 10.1111/j.1749-6632.1982.tb50829.x.

引用本文的文献

1
The ripple pond: enabling spiking networks to see.涟漪池:使尖峰网络“看见”。
Front Neurosci. 2013 Nov 15;7:212. doi: 10.3389/fnins.2013.00212. eCollection 2013.
2
Fourier transform magnitudes are unique pattern recognition templates.
Biol Cybern. 1986;54(6):385-91. doi: 10.1007/BF00355544.

本文引用的文献

1
A Fast Correlation Method for Scale-and Translation-Invariant Pattern Recognition.一种用于尺度和平移不变模式识别的快速相关方法。
IEEE Trans Pattern Anal Mach Intell. 1984 Jan;6(1):46-57. doi: 10.1109/tpami.1984.4767474.
2
Position, rotation, and scale invariant optical correlation.位置、旋转和尺度不变光学相关
Appl Opt. 1976 Jul 1;15(7):1795-9. doi: 10.1364/AO.15.001795.
3
The angular selectivity of visual cortical cells to moving gratings.视觉皮层细胞对移动光栅的角度选择性。
J Physiol. 1968 Sep;198(1):237-50. doi: 10.1113/jphysiol.1968.sp008604.
4
PROJECTION OF THE RETINA ON TO STRIATE AND PRESTRIATE CORTEX IN THE SQUIRREL MONKEY, SAIMIRI SCIUREUS.松鼠猴(Saimiri sciureus)视网膜在纹状皮层和纹前皮层上的投射
J Neurophysiol. 1964 May;27:366-93. doi: 10.1152/jn.1964.27.3.366.
5
The representation of the visual field on the cerebral cortex in monkeys.猴子大脑皮层上视野的表征。
J Physiol. 1961 Dec;159(2):203-21. doi: 10.1113/jphysiol.1961.sp006803.
6
Computational anatomy and functional architecture of striate cortex: a spatial mapping approach to perceptual coding.纹状皮层的计算解剖学与功能结构:一种用于感知编码的空间映射方法。
Vision Res. 1980;20(8):645-69. doi: 10.1016/0042-6989(80)90090-5.
7
Size invariance: reply to Schwartz.
Perception. 1981;10(4):469-74. doi: 10.1068/p100469.
8
Cortical anatomy, size invariance, and spatial frequency analysis.
Perception. 1981;10(4):455-68. doi: 10.1068/p100455.
9
Magnification factor and receptive field size in foveal striate cortex of the monkey.猴子中央凹纹状皮层的放大因子和感受野大小。
Exp Brain Res. 1981;44(2):213-28. doi: 10.1007/BF00237343.
10
Functional size invariance is not provided by the cortical magnification factor.
Vision Res. 1982;22(11):1409-12. doi: 10.1016/0042-6989(82)90231-0.