• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用因子高阶玻尔兹曼机来学习表示空间变换。

Learning to represent spatial transformations with factored higher-order Boltzmann machines.

机构信息

Department of Computer Science, University of Toronto, Toronto M5S 3G4, Canada.

出版信息

Neural Comput. 2010 Jun;22(6):1473-92. doi: 10.1162/neco.2010.01-09-953.

DOI:10.1162/neco.2010.01-09-953
PMID:20141471
Abstract

To allow the hidden units of a restricted Boltzmann machine to model the transformation between two successive images, Memisevic and Hinton (2007) introduced three-way multiplicative interactions that use the intensity of a pixel in the first image as a multiplicative gain on a learned, symmetric weight between a pixel in the second image and a hidden unit. This creates cubically many parameters, which form a three-dimensional interaction tensor. We describe a low-rank approximation to this interaction tensor that uses a sum of factors, each of which is a three-way outer product. This approximation allows efficient learning of transformations between larger image patches. Since each factor can be viewed as an image filter, the model as a whole learns optimal filter pairs for efficiently representing transformations. We demonstrate the learning of optimal filter pairs from various synthetic and real image sequences. We also show how learning about image transformations allows the model to perform a simple visual analogy task, and we show how a completely unsupervised network trained on transformations perceives multiple motions of transparent dot patterns in the same way as humans.

摘要

为了让受限玻尔兹曼机的隐藏单元能够模拟两幅连续图像之间的变换,Memisevic 和 Hinton(2007 年)引入了三向乘法交互作用,该作用使用第一幅图像中某个像素的强度作为第二幅图像中某个像素与隐藏单元之间的已学习对称权重的乘法增益。这会产生三次方数量的参数,这些参数构成了三维交互张量。我们描述了这种交互张量的低秩逼近方法,该方法使用因子之和,每个因子都是三向外积。这种逼近方法允许在更大的图像块之间进行高效的变换学习。由于每个因子都可以看作是图像滤波器,因此整个模型学习最佳滤波器对以有效表示变换。我们展示了从各种合成和真实图像序列中学习最佳滤波器对的方法。我们还展示了学习图像变换如何使模型能够执行简单的视觉类比任务,并且我们展示了在变换上进行完全无监督训练的网络如何以与人类相同的方式感知透明点模式的多种运动。

相似文献

1
Learning to represent spatial transformations with factored higher-order Boltzmann machines.用因子高阶玻尔兹曼机来学习表示空间变换。
Neural Comput. 2010 Jun;22(6):1473-92. doi: 10.1162/neco.2010.01-09-953.
2
Bilinear sparse coding for invariant vision.用于不变视觉的双线性稀疏编码
Neural Comput. 2005 Jan;17(1):47-73. doi: 10.1162/0899766052530893.
3
Convolutional networks can learn to generate affinity graphs for image segmentation.卷积网络可以学习生成图像分割的亲和图。
Neural Comput. 2010 Feb;22(2):511-38. doi: 10.1162/neco.2009.10-08-881.
4
Adaptive two-pass median filter based on support vector machines for image restoration.基于支持向量机的自适应两遍中值滤波器用于图像恢复
Neural Comput. 2004 Feb;16(2):332-53. doi: 10.1162/089976604322742056.
5
Learning the Lie groups of visual invariance.学习视觉不变性的李群。
Neural Comput. 2007 Oct;19(10):2665-93. doi: 10.1162/neco.2007.19.10.2665.
6
Learning viewpoint invariant perceptual representations from cluttered images.从杂乱图像中学习视角不变的感知表征。
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):753-61. doi: 10.1109/TPAMI.2005.105.
7
Representational power of restricted boltzmann machines and deep belief networks.受限玻尔兹曼机和深度信念网络的表征能力。
Neural Comput. 2008 Jun;20(6):1631-49. doi: 10.1162/neco.2008.04-07-510.
8
Learning representations for object classification using multi-stage optimal component analysis.使用多阶段最优成分分析学习用于对象分类的表示。
Neural Netw. 2008 Mar-Apr;21(2-3):214-21. doi: 10.1016/j.neunet.2007.12.023. Epub 2007 Dec 28.
9
Determining the receptive field of a neural filter.确定神经滤波器的感受野。
J Neural Eng. 2004 Dec;1(4):228-37. doi: 10.1088/1741-2560/1/4/006. Epub 2004 Dec 2.
10
Incremental nonlinear dimensionality reduction by manifold learning.基于流形学习的增量非线性降维
IEEE Trans Pattern Anal Mach Intell. 2006 Mar;28(3):377-91. doi: 10.1109/TPAMI.2006.56.

引用本文的文献

1
Sustainable Solutions for Sea Monitoring With Robotic Sailboats: N-Boat and F-Boat Twins.用于海洋监测的可持续解决方案:N型船和F型船双体机器人帆船
Front Robot AI. 2022 Apr 5;9:788212. doi: 10.3389/frobt.2022.788212. eCollection 2022.
2
Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions.深度学习:关于技术、分类法、应用及研究方向的全面综述
SN Comput Sci. 2021;2(6):420. doi: 10.1007/s42979-021-00815-1. Epub 2021 Aug 18.
3
Gum-Net: Unsupervised Geometric Matching for Fast and Accurate 3D Subtomogram Image Alignment and Averaging.
Gum-Net:用于快速准确的三维亚断层图像对齐与平均的无监督几何匹配
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2020 Jun;2020:4072-4082. doi: 10.1109/cvpr42600.2020.00413. Epub 2020 Aug 5.
4
BBS Posts Time Series Analysis based on Sample Entropy and Deep Neural Networks.基于样本熵和深度神经网络的论坛帖子时间序列分析
Entropy (Basel). 2019 Jan 12;21(1):57. doi: 10.3390/e21010057.
5
AI, visual imagery, and a case study on the challenges posed by human intelligence tests.人工智能、视觉意象,以及对人类智力测验所带来的挑战的案例研究。
Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29390-29397. doi: 10.1073/pnas.1912335117.
6
An intelligent platform for ultrasound diagnosis of thyroid nodules.甲状腺结节超声诊断智能平台。
Sci Rep. 2020 Aug 6;10(1):13223. doi: 10.1038/s41598-020-70159-y.
7
Brain-Inspired Coding of Robot Body Schema Through Visuo-Motor Integration of Touched Events.通过触摸事件的视觉-运动整合对机器人身体图式进行脑启发式编码
Front Neurorobot. 2019 Mar 7;13:5. doi: 10.3389/fnbot.2019.00005. eCollection 2019.
8
An Embodied Multi-Sensor Fusion Approach to Visual Motion Estimation Using Unsupervised Deep Networks.基于无监督深度学习的多传感器融合视觉运动估计方法
Sensors (Basel). 2018 May 4;18(5):1427. doi: 10.3390/s18051427.
9
The visual system's internal model of the world.视觉系统对世界的内部模型。
Proc IEEE Inst Electr Electron Eng. 2015 Aug;103(8):1359-1378. doi: 10.1109/JPROC.2015.2434601. Epub 2015 Jul 6.