FLGR：用于基于RNN-HMM混合模型的神经形态连续手势识别的定长要点表示学习

FLGR: Fixed Length Gists Representation Learning for RNN-HMM Hybrid-Based Neuromorphic Continuous Gesture Recognition.

作者信息

Chen Guang, Chen Jieneng, Lienen Marten, Conradt Jörg, Röhrbein Florian, Knoll Alois C

机构信息

College of Automotive Engineering, Tongji University, Shanghai, China.

Chair of Robotics, Artificial Intelligence and Real-time Systems, Technische Universität München, Munich, Germany.

出版信息

Front Neurosci. 2019 Feb 12;13:73. doi: 10.3389/fnins.2019.00073. eCollection 2019.

DOI:10.3389/fnins.2019.00073

PMID:30809114

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6380225/

Abstract

A neuromorphic vision sensors is a novel passive sensing modality and frameless sensors with several advantages over conventional cameras. Frame-based cameras have an average frame-rate of 30 fps, causing motion blur when capturing fast motion, e.g., hand gesture. Rather than wastefully sending entire images at a fixed frame rate, neuromorphic vision sensors only transmit the local pixel-level changes induced by the movement in a scene when they occur. This leads to advantageous characteristics, including low energy consumption, high dynamic range, a sparse event stream and low response latency. In this study, a novel representation learning method was proposed: Fixed Length Gists Representation (FLGR) learning for event-based gesture recognition. Previous methods accumulate events into video frames in a time duration (e.g., 30 ms) to make the accumulated image-level representation. However, the accumulated-frame-based representation waives the friendly event-driven paradigm of neuromorphic vision sensor. New representation are urgently needed to fill the gap in non-accumulated-frame-based representation and exploit the further capabilities of neuromorphic vision. The proposed FLGR is a sequence learned from mixture density autoencoder and preserves the nature of event-based data better. FLGR has a data format of fixed length, and it is easy to feed to sequence classifier. Moreover, an RNN-HMM hybrid was proposed to address the continuous gesture recognition problem. Recurrent neural network (RNN) was applied for FLGR sequence classification while hidden Markov model (HMM) is employed for localizing the candidate gesture and improving the result in a continuous sequence. A neuromorphic continuous hand gestures dataset (Neuro ConGD Dataset) was developed with 17 hand gestures classes for the community of the neuromorphic research. Hopefully, FLGR can inspire the study on the event-based highly efficient, high-speed, and high-dynamic-range sequence classification tasks.

摘要

神经形态视觉传感器是一种新型的无源传感模式和无帧传感器，与传统相机相比具有多个优势。基于帧的相机平均帧率为30帧/秒，在捕捉快速动作（如手势）时会产生运动模糊。神经形态视觉传感器不会以固定帧率浪费地发送完整图像，而是仅在场景中的运动引起局部像素级变化发生时进行传输。这带来了包括低能耗、高动态范围、稀疏事件流和低响应延迟等优势特性。在本研究中，提出了一种新颖的表征学习方法：用于基于事件的手势识别的固定长度要点表征（FLGR）学习。先前的方法在一段时间（如30毫秒）内将事件累积到视频帧中，以生成累积的图像级表征。然而，基于累积帧的表征放弃了神经形态视觉传感器友好的事件驱动范式。迫切需要新的表征来填补基于非累积帧表征的空白，并挖掘神经形态视觉的进一步能力。所提出的FLGR是从混合密度自动编码器学习到的序列，能更好地保留基于事件的数据的本质。FLGR具有固定长度的数据格式，易于输入到序列分类器中。此外，还提出了一种循环神经网络（RNN）与隐马尔可夫模型（HMM）的混合模型来解决连续手势识别问题。循环神经网络用于FLGR序列分类，而隐马尔可夫模型用于定位候选手势并在连续序列中改进结果。为神经形态研究领域开发了一个包含17种手势类别的神经形态连续手势数据集（Neuro ConGD Dataset）。希望FLGR能激发对基于事件的高效、高速和高动态范围序列分类任务的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9791/6380225/37db232eac7a/fnins-13-00073-g0002.jpg

相似文献

FLGR: Fixed Length Gists Representation Learning for RNN-HMM Hybrid-Based Neuromorphic Continuous Gesture Recognition.

Front Neurosci. 2019 Feb 12;13:73. doi: 10.3389/fnins.2019.00073. eCollection 2019.

Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing.

Front Neurosci. 2020 Aug 5;14:637. doi: 10.3389/fnins.2020.00637. eCollection 2020.

CIFAR10-DVS: An Event-Stream Dataset for Object Classification.

Front Neurosci. 2017 May 30;11:309. doi: 10.3389/fnins.2017.00309. eCollection 2017.

Real-time gesture interface based on event-driven processing from stereo silicon retinas.

IEEE Trans Neural Netw Learn Syst. 2014 Dec;25(12):2250-63. doi: 10.1109/TNNLS.2014.2308551.

HAGR-D: A Novel Approach for Gesture Recognition with Depth Maps.

Sensors (Basel). 2015 Nov 12;15(11):28646-64. doi: 10.3390/s151128646.

Neuromorphic-PM: processing-in-pixel-in-memory paradigm for neuromorphic image sensors.

Front Neuroinform. 2023 May 4;17:1144301. doi: 10.3389/fninf.2023.1144301. eCollection 2023.

ES-ImageNet: A Million Event-Stream Classification Dataset for Spiking Neural Networks.

Front Neurosci. 2021 Nov 25;15:726582. doi: 10.3389/fnins.2021.726582. eCollection 2021.

Tracking and Classification of In-Air Hand Gesture Based on Thermal Guided Joint Filter.

Sensors (Basel). 2017 Jan 17;17(1):166. doi: 10.3390/s17010166.

MFA-Net: Motion Feature Augmented Network for Dynamic Hand Gesture Recognition from Skeletal Data.

Sensors (Basel). 2019 Jan 10;19(2):239. doi: 10.3390/s19020239.

A New Spiking Convolutional Recurrent Neural Network (SCRNN) With Applications to Event-Based Hand Gesture Recognition.

Front Neurosci. 2020 Nov 17;14:590164. doi: 10.3389/fnins.2020.590164. eCollection 2020.

引用本文的文献

Event-Based Optical Flow Estimation with Spatio-Temporal Backpropagation Trained Spiking Neural Network.

Micromachines (Basel). 2023 Jan 13;14(1):203. doi: 10.3390/mi14010203.

Event-Based Gesture Recognition With Dynamic Background Suppression Using Smartphone Computational Capabilities.

Front Neurosci. 2020 Apr 9;14:275. doi: 10.3389/fnins.2020.00275. eCollection 2020.

本文引用的文献

Spatio-Temporal Backpropagation for Training High-Performance Spiking Neural Networks.

Front Neurosci. 2018 May 23;12:331. doi: 10.3389/fnins.2018.00331. eCollection 2018.

DVS Benchmark Datasets for Object Tracking, Action Recognition, and Object Recognition.

Front Neurosci. 2016 Aug 31;10:405. doi: 10.3389/fnins.2016.00405. eCollection 2016.

Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition.

IEEE Trans Pattern Anal Mach Intell. 2016 Aug;38(8):1583-97. doi: 10.1109/TPAMI.2016.2537340. Epub 2016 Mar 2.

Real-time gesture interface based on event-driven processing from stereo silicon retinas.

IEEE Trans Neural Netw Learn Syst. 2014 Dec;25(12):2250-63. doi: 10.1109/TNNLS.2014.2308551.

Robotic goalie with 3 ms reaction time at 4% CPU load using event-based dynamic vision sensor.

Front Neurosci. 2013 Nov 21;7:223. doi: 10.3389/fnins.2013.00223. eCollection 2013.

3D convolutional neural networks for human action recognition.

IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.

Long short-term memory.

Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

FLGR：用于基于RNN-HMM混合模型的神经形态连续手势识别的定长要点表示学习

FLGR: Fixed Length Gists Representation Learning for RNN-HMM Hybrid-Based Neuromorphic Continuous Gesture Recognition.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献