• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用神经形态架构从关键姿势进行实时生物启发式动作识别。

Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture.

作者信息

Layher Georg, Brosch Tobias, Neumann Heiko

机构信息

Institute of Neural Information Processing, Ulm University Ulm, Germany.

出版信息

Front Neurorobot. 2017 Mar 22;11:13. doi: 10.3389/fnbot.2017.00013. eCollection 2017.

DOI:10.3389/fnbot.2017.00013
PMID:28381998
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5360715/
Abstract

Intelligent agents, such as robots, have to serve a multitude of autonomous functions. Examples are, e.g., collision avoidance, navigation and route planning, active sensing of its environment, or the interaction and non-verbal communication with people in the extended reach space. Here, we focus on the recognition of the action of a human agent based on a biologically inspired visual architecture of analyzing articulated movements. The proposed processing architecture builds upon coarsely segregated streams of sensory processing along different pathways which separately process form and motion information (Layher et al., 2014). Action recognition is performed in an event-based scheme by identifying representations of characteristic pose configurations (key poses) in an image sequence. In line with perceptual studies, key poses are selected unsupervised utilizing a feature-driven criterion which combines extrema in the motion energy with the horizontal and the vertical extendedness of a body shape. Per class representations of key pose frames are learned using a deep convolutional neural network consisting of 15 convolutional layers. The network is trained using the () framework (Esser et al., 2016), which realizes the mapping of the trained synaptic weights onto the platform (Merolla et al., 2014). After the mapping, the trained network achieves real-time capabilities for processing input streams and classify input images at about 1,000 frames per second while the computational stages only consume about 70 mW of energy (without spike transduction). Particularly regarding mobile robotic systems, a low energy profile might be crucial in a variety of application scenarios. Cross-validation results are reported for two different datasets and compared to state-of-the-art action recognition approaches. The results demonstrate, that (I) the presented approach is on par with other key pose based methods described in the literature, which select key pose frames by optimizing classification accuracy, (II) compared to the training on the full set of frames, representations trained on key pose frames result in a higher confidence in class assignments, and (III) key pose representations show promising generalization capabilities in a cross-dataset evaluation.

摘要

智能体,如机器人,必须执行多种自主功能。例如,避撞、导航与路线规划、对其环境的主动感知,或在扩展可达空间内与人员的交互和非语言通信。在此,我们专注于基于一种受生物启发的用于分析关节运动的视觉架构来识别人类智能体的动作。所提出的处理架构基于沿不同路径粗略分离的感官处理流构建,这些路径分别处理形状和运动信息(莱赫尔等人,2014年)。动作识别通过基于事件的方案进行,即识别图像序列中特征姿势配置(关键姿势)的表示。与感知研究一致,关键姿势是利用一种特征驱动标准无监督选择的,该标准将运动能量中的极值与身体形状的水平和垂直延伸相结合。使用由15个卷积层组成的深度卷积神经网络来学习关键姿势帧的每类表示。该网络使用()框架(埃塞尔等人,2016年)进行训练,该框架实现了将训练后的突触权重映射到平台上(梅罗拉等人,2014年)。映射之后,训练后的网络实现了处理输入流的实时能力,并以每秒约1000帧的速度对输入图像进行分类,而计算阶段仅消耗约70毫瓦的能量(无尖峰转换)。特别是对于移动机器人系统,低能耗在各种应用场景中可能至关重要。报告了针对两个不同数据集的交叉验证结果,并与当前最先进的动作识别方法进行了比较。结果表明,(I)所提出的方法与文献中描述的其他基于关键姿势的方法相当,这些方法通过优化分类准确率来选择关键姿势帧;(II)与在完整帧集上进行训练相比,在关键姿势帧上训练的表示在类别分配中具有更高的置信度;(III)关键姿势表示在跨数据集评估中显示出有前景的泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/1dc9be97e2a3/fnbot-11-00013-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/308edb8ecc89/fnbot-11-00013-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/08bb93538f88/fnbot-11-00013-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/22db016a29b0/fnbot-11-00013-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/015ae41ffdce/fnbot-11-00013-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/203f68f4c891/fnbot-11-00013-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/b163e6f15ab3/fnbot-11-00013-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/9d933d05c7bb/fnbot-11-00013-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/a686c02655f4/fnbot-11-00013-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/1dc9be97e2a3/fnbot-11-00013-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/308edb8ecc89/fnbot-11-00013-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/08bb93538f88/fnbot-11-00013-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/22db016a29b0/fnbot-11-00013-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/015ae41ffdce/fnbot-11-00013-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/203f68f4c891/fnbot-11-00013-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/b163e6f15ab3/fnbot-11-00013-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/9d933d05c7bb/fnbot-11-00013-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/a686c02655f4/fnbot-11-00013-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b44/5360715/1dc9be97e2a3/fnbot-11-00013-g0009.jpg

相似文献

1
Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture.使用神经形态架构从关键姿势进行实时生物启发式动作识别。
Front Neurorobot. 2017 Mar 22;11:13. doi: 10.3389/fnbot.2017.00013. eCollection 2017.
2
Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能
3
Convolutional networks for fast, energy-efficient neuromorphic computing.用于快速、节能神经形态计算的卷积网络。
Proc Natl Acad Sci U S A. 2016 Oct 11;113(41):11441-11446. doi: 10.1073/pnas.1604850113. Epub 2016 Sep 20.
4
Self-organizing neural integration of pose-motion features for human action recognition.用于人体动作识别的姿态-运动特征自组织神经整合
Front Neurorobot. 2015 Jun 9;9:3. doi: 10.3389/fnbot.2015.00003. eCollection 2015.
5
Engineering Aspects of Olfaction嗅觉的工程学方面
6
Synaptic Device Network Architecture with Feature Extraction for Unsupervised Image Classification.具有特征提取功能的突触器件网络架构用于无监督图像分类
Small. 2018 Aug;14(32):e1800521. doi: 10.1002/smll.201800521. Epub 2018 Jul 15.
7
A neuromorphic system for video object recognition.一种用于视频目标识别的神经形态系统。
Front Comput Neurosci. 2014 Nov 28;8:147. doi: 10.3389/fncom.2014.00147. eCollection 2014.
8
Keys for Action: An Efficient Keyframe-Based Approach for 3D Action Recognition Using a Deep Neural Network.关键动作:基于关键帧的深度学习网络三维动作识别高效方法。
Sensors (Basel). 2020 Apr 15;20(8):2226. doi: 10.3390/s20082226.
9
An Extreme Learning Machine-Based Neuromorphic Tactile Sensing System for Texture Recognition.基于极限学习机的仿生触觉纹理识别传感系统。
IEEE Trans Biomed Circuits Syst. 2018 Apr;12(2):313-325. doi: 10.1109/TBCAS.2018.2805721.
10
DeepDynamicHand: A Deep Neural Architecture for Labeling Hand Manipulation Strategies in Video Sources Exploiting Temporal Information.深度动态手部模型:一种利用时间信息对视频源中的手部操作策略进行标注的深度神经架构。
Front Neurorobot. 2018 Dec 17;12:86. doi: 10.3389/fnbot.2018.00086. eCollection 2018.

引用本文的文献

1
Experimental interpretation of adequate weight-metric combination for dynamic user-based collaborative filtering.基于动态用户的协同过滤中适当权重度量组合的实验解释
PeerJ Comput Sci. 2021 Dec 9;7:e784. doi: 10.7717/peerj-cs.784. eCollection 2021.
2
An Invitation to Greater Use of Matthews Correlation Coefficient in Robotics and Artificial Intelligence.关于在机器人技术和人工智能中更多地使用马修斯相关系数的邀请。
Front Robot AI. 2022 Mar 25;9:876814. doi: 10.3389/frobt.2022.876814. eCollection 2022.
3
Points and Stripes: A Novel Technique for Masking Biological Motion Point-Light Stimuli.

本文引用的文献

1
Convolutional networks for fast, energy-efficient neuromorphic computing.用于快速、节能神经形态计算的卷积网络。
Proc Natl Acad Sci U S A. 2016 Oct 11;113(41):11441-11446. doi: 10.1073/pnas.1604850113. Epub 2016 Sep 20.
2
Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.深度神经网络揭示了腹侧流中神经表征复杂性的梯度变化。
J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.
3
Self-Organization of Spatio-Temporal Hierarchy via Learning of Dynamic Visual Image Patterns on Action Sequences.
点与条纹:一种用于掩盖生物运动点光刺激的新技术。
Front Psychol. 2018 Aug 28;9:1455. doi: 10.3389/fpsyg.2018.01455. eCollection 2018.
通过学习动作序列上的动态视觉图像模式实现时空层次结构的自组织
PLoS One. 2015 Jul 6;10(7):e0131214. doi: 10.1371/journal.pone.0131214. eCollection 2015.
4
Artificial brains. A million spiking-neuron integrated circuit with a scalable communication network and interface.人工大脑。具有可扩展通信网络和接口的 100 万个尖峰神经元集成电路。
Science. 2014 Aug 8;345(6197):668-73. doi: 10.1126/science.1254642. Epub 2014 Aug 7.
5
Responses of Anterior Superior Temporal Polysensory (STPa) Neurons to "Biological Motion" Stimuli.对前上颞多感觉(STPa)神经元对“生物运动”刺激的反应。
J Cogn Neurosci. 1994 Spring;6(2):99-116. doi: 10.1162/jocn.1994.6.2.99.
6
Learning discriminative key poses for action recognition.学习用于动作识别的判别关键姿势。
IEEE Trans Cybern. 2013 Dec;43(6):1860-70. doi: 10.1109/TSMCB.2012.2231959.
7
3D convolutional neural networks for human action recognition.三维卷积神经网络的人体动作识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.
8
Contributions of form, motion and task to biological motion perception.形态、运动和任务对生物运动感知的作用。
J Vis. 2009 Mar 31;9(3):28.1-11. doi: 10.1167/9.3.28.
9
Recognizing emotions expressed by body pose: a biologically inspired neural model.识别身体姿势所表达的情绪:一种受生物启发的神经模型。
Neural Netw. 2008 Nov;21(9):1238-46. doi: 10.1016/j.neunet.2008.05.003. Epub 2008 Jun 27.
10
Temporal "Bubbles" reveal key features for point-light biological motion perception.颞部“气泡”揭示了点光生物运动感知的关键特征。
J Vis. 2008 Mar 27;8(3):28.1-11. doi: 10.1167/8.3.28.