• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习如何使用深度架构进行图像跟踪。

Learning where to attend with deep architectures for image tracking.

机构信息

University of British Columbia, Vancouver, BC V6G 1Z4, Canada.

出版信息

Neural Comput. 2012 Aug;24(8):2151-84. doi: 10.1162/NECO_a_00312. Epub 2012 Apr 17.

DOI:10.1162/NECO_a_00312
PMID:22509964
Abstract

We discuss an attentional model for simultaneous object tracking and recognition that is driven by gaze data. Motivated by theories of perception, the model consists of two interacting pathways, identity and control, intended to mirror the what and where pathways in neuroscience models. The identity pathway models object appearance and performs classification using deep (factored)-restricted Boltzmann machines. At each point in time, the observations consist of foveated images, with decaying resolution toward the periphery of the gaze. The control pathway models the location, orientation, scale, and speed of the attended object. The posterior distribution of these states is estimated with particle filtering. Deeper in the control pathway, we encounter an attentional mechanism that learns to select gazes so as to minimize tracking uncertainty. Unlike in our previous work, we introduce gaze selection strategies that operate in the presence of partial information and on a continuous action space. We show that a straightforward extension of the existing approach to the partial information setting results in poor performance, and we propose an alternative method based on modeling the reward surface as a gaussian process. This approach gives good performance in the presence of partial information and allows us to expand the action space from a small, discrete set of fixation points to a continuous domain.

摘要

我们讨论了一种基于注视数据的同时目标跟踪和识别的注意力模型。受感知理论的启发,该模型由两个相互作用的路径组成,即身份和控制,旨在反映神经科学模型中的“什么”和“哪里”路径。身份路径模型使用深度(因子化)受限玻尔兹曼机来模拟物体的外观并进行分类。在每个时间点,观察结果由注视中心的高分辨率图像和边缘的低分辨率图像组成。控制路径模型的位置、方向、比例和注视物体的速度。这些状态的后验分布使用粒子滤波进行估计。在控制路径的更深处,我们遇到了一种注意力机制,它学会了选择注视点,以最小化跟踪不确定性。与我们之前的工作不同,我们引入了在存在部分信息和连续动作空间的情况下进行注视选择的策略。我们表明,将现有方法直接扩展到部分信息设置会导致性能不佳,我们提出了一种基于将奖励曲面建模为高斯过程的替代方法。这种方法在存在部分信息的情况下表现良好,并允许我们将动作空间从一个小的、离散的固定点集扩展到一个连续的域。

相似文献

1
Learning where to attend with deep architectures for image tracking.学习如何使用深度架构进行图像跟踪。
Neural Comput. 2012 Aug;24(8):2151-84. doi: 10.1162/NECO_a_00312. Epub 2012 Apr 17.
2
View-invariant object category learning, recognition, and search: how spatial and object attention are coordinated using surface-based attentional shrouds.视图不变物体类别学习、识别与搜索:基于表面的注意力罩如何协调空间和物体注意力。
Cogn Psychol. 2009 Feb;58(1):1-48. doi: 10.1016/j.cogpsych.2008.05.001. Epub 2008 Jul 23.
3
Pictorial gaze cues do not enhance long-tailed macaques' performance on a computerised object-location task.图像注视线索并不能提高长尾猕猴在计算机化物体定位任务中的表现。
Behav Processes. 2006 Nov 1;73(3):308-14. doi: 10.1016/j.beproc.2006.07.003. Epub 2006 Jul 25.
4
Object-based attention: shifting or uncertainty?基于客体的注意:转移还是不确定性?
Atten Percept Psychophys. 2010 Oct;72(7):1743-55. doi: 10.3758/APP.72.7.1743.
5
If I saw it, it probably wasn't far from where I was looking.如果我看到了它,那它可能离我当时看的地方不远。
J Vis. 2008 Feb 21;8(2):7.1-10. doi: 10.1167/8.2.7.
6
Learning where to direct gaze during change detection.学习在变化检测过程中注视的方向。
J Vis. 2007 Nov 30;7(14):6.1-12. doi: 10.1167/7.14.6.
7
Design and application of real-time visual attention model for the exploration of 3D virtual environments.三维虚拟环境探索的实时视觉注意模型的设计与应用。
IEEE Trans Vis Comput Graph. 2012 Mar;18(3):356-68. doi: 10.1109/TVCG.2011.154.
8
Dynamic binding of identity and location information: a serial model of multiple identity tracking.身份与位置信息的动态绑定:多身份追踪的序列模型
Cogn Psychol. 2008 Jun;56(4):237-83. doi: 10.1016/j.cogpsych.2007.03.001. Epub 2007 Apr 23.
9
Quantifying and transferring contextual information in object detection.量化和转移目标检测中的上下文信息。
IEEE Trans Pattern Anal Mach Intell. 2012 Apr;34(4):762-77. doi: 10.1109/TPAMI.2011.164.
10
Tracking an object through feature space.通过特征空间跟踪一个物体。
Nature. 2000 Nov 9;408(6809):196-9. doi: 10.1038/35041567.

引用本文的文献

1
A dual foveal-peripheral visual processing model implements efficient saccade selection.一种双中央凹-周边视觉处理模型实现了高效的扫视选择。
J Vis. 2020 Aug 3;20(8):22. doi: 10.1167/jov.20.8.22.
2
A Systematic Review of Artificial Intelligence Techniques in Cancer Prediction and Diagnosis.人工智能技术在癌症预测与诊断中的系统评价
Arch Comput Methods Eng. 2022;29(4):2043-2070. doi: 10.1007/s11831-021-09648-w. Epub 2021 Sep 27.
3
Device-Free Human Identification Using Behavior Signatures in WiFi Sensing.基于 WiFi 感知的行为特征的无设备人体识别
Sensors (Basel). 2021 Sep 3;21(17):5921. doi: 10.3390/s21175921.
4
Service humanoid robotics: a novel interactive system based on bionic-companionship framework.
PeerJ Comput Sci. 2021 Aug 13;7:e674. doi: 10.7717/peerj-cs.674. eCollection 2021.
5
Survey on Deep Neural Networks in Speech and Vision Systems.语音与视觉系统中的深度神经网络调查
Neurocomputing (Amst). 2020 Dec 5;417:302-321. doi: 10.1016/j.neucom.2020.07.053. Epub 2020 Jul 26.
6
TITER: predicting translation initiation sites by deep learning.TITER:通过深度学习预测翻译起始位点。
Bioinformatics. 2017 Jul 15;33(14):i234-i242. doi: 10.1093/bioinformatics/btx247.
7
Invariant visual object recognition: biologically plausible approaches.不变视觉物体识别:生物学上可行的方法。
Biol Cybern. 2015 Oct;109(4-5):505-35. doi: 10.1007/s00422-015-0658-2. Epub 2015 Sep 3.
8
Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems.在自然场景中发现和识别物体:背侧和腹侧视觉系统的互补计算。
Front Comput Neurosci. 2014 Aug 12;8:85. doi: 10.3389/fncom.2014.00085. eCollection 2014.