基于事件的视觉的极早期图像识别。

Extreme Early Image Recognition Using Event-Based Vision.

机构信息

Division of Information and Computing Technology, College of Science and Engineering, Hamad Bin Khalifa University, Doha P.O. Box 34110, Qatar.

出版信息

Sensors (Basel). 2023 Jul 6;23(13):6195. doi: 10.3390/s23136195.

DOI:10.3390/s23136195

PMID:37448044

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10346239/

Abstract

While deep learning algorithms have advanced to a great extent, they are all designed for frame-based imagers that capture images at a high frame rate, which leads to a high storage requirement, heavy computations, and very high power consumption. Unlike frame-based imagers, event-based imagers output asynchronous pixel events without the need for global exposure time, therefore lowering both power consumption and latency. In this paper, we propose an innovative image recognition technique that operates on image events rather than frame-based data, paving the way for a new paradigm of recognizing objects prior to image acquisition. To the best of our knowledge, this is the first time such a concept is introduced featuring not only extreme early image recognition but also reduced computational overhead, storage requirement, and power consumption. Our collected event-based dataset using CeleX imager and five public event-based datasets are used to prove this concept, and the testing metrics reflect how early the neural network (NN) detects an image before the full-frame image is captured. It is demonstrated that, on average for all the datasets, the proposed technique recognizes an image 38.7 ms before the first perfect event and 603.4 ms before the last event is received, which is a reduction of 34% and 69% of the time needed, respectively. Further, less processing is required as the image is recognized 9460 events earlier, which is 37% less than waiting for the first perfectly recognized image. An enhanced NN method is also introduced to reduce this time.

摘要

虽然深度学习算法已经取得了很大的进展，但它们都是为基于帧的成像仪设计的，这种成像仪以高帧率捕捉图像，这导致了高存储需求、大量计算和非常高的功耗。与基于帧的成像仪不同，事件型成像仪输出异步像素事件，而不需要全局曝光时间，因此降低了功耗和延迟。在本文中，我们提出了一种基于图像事件而不是基于帧数据的创新图像识别技术，为在图像采集之前识别物体开辟了新的范例。据我们所知，这是首次引入这样的概念，不仅具有极端的早期图像识别，而且减少了计算开销、存储需求和功耗。我们使用 CeleX 成像仪和五个公共事件型数据集收集的事件型数据集证明了这一概念，测试指标反映了神经网络（NN）在捕获全帧图像之前多早检测到图像。结果表明，对于所有数据集，平均而言，所提出的技术在首次接收到完美事件之前识别图像的时间提前了 38.7 毫秒，在接收到最后一个事件之前提前了 603.4 毫秒，分别减少了 34%和 69%的时间。此外，由于图像在更早的 9460 个事件中被识别，因此所需的处理更少，比等待第一个完全识别的图像少 37%。我们还引入了一种增强型神经网络方法来减少这种时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a4c/10346239/a011d9681a39/sensors-23-06195-g001.jpg

相似文献

Extreme Early Image Recognition Using Event-Based Vision.

Sensors (Basel). 2023 Jul 6;23(13):6195. doi: 10.3390/s23136195.

Mapping from frame-driven to frame-free event-driven vision systems by low-rate rate coding and coincidence processing--application to feedforward ConvNets.

IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2706-19. doi: 10.1109/TPAMI.2013.71.

Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks.

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3436-3449. doi: 10.1109/TPAMI.2021.3054886. Epub 2022 Jun 3.

Fast vision through frameless event-based sensing and convolutional processing: application to texture recognition.

IEEE Trans Neural Netw. 2010 Apr;21(4):609-20. doi: 10.1109/TNN.2009.2039943. Epub 2010 Feb 22.

Event-Based Vision: A Survey.

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):154-180. doi: 10.1109/TPAMI.2020.3008413. Epub 2021 Dec 7.

FLGR: Fixed Length Gists Representation Learning for RNN-HMM Hybrid-Based Neuromorphic Continuous Gesture Recognition.

Front Neurosci. 2019 Feb 12;13:73. doi: 10.3389/fnins.2019.00073. eCollection 2019.

Progressive Early Image Recognition for Wireless Vision Sensor Networks.

Sensors (Basel). 2022 Aug 24;22(17):6348. doi: 10.3390/s22176348.

Isolated single sound lip-reading using a frame-based camera and event-based camera.

Front Artif Intell. 2023 Jan 11;5:1070964. doi: 10.3389/frai.2022.1070964. eCollection 2022.

Optimizing Deeper Spiking Neural Networks for Dynamic Vision Sensing.

Neural Netw. 2021 Dec;144:686-698. doi: 10.1016/j.neunet.2021.09.022. Epub 2021 Oct 5.

Multi-Stage Network for Event-Based Video Deblurring with Residual Hint Attention.

Sensors (Basel). 2023 Mar 7;23(6):2880. doi: 10.3390/s23062880.

本文引用的文献

Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing.

Front Neurosci. 2020 Aug 5;14:637. doi: 10.3389/fnins.2020.00637. eCollection 2020.

CIFAR10-DVS: An Event-Stream Dataset for Object Classification.

Front Neurosci. 2017 May 30;11:309. doi: 10.3389/fnins.2017.00309. eCollection 2017.

Converting Static Image Datasets to Spiking Neuromorphic Datasets Using Saccades.

Front Neurosci. 2015 Nov 16;9:437. doi: 10.3389/fnins.2015.00437. eCollection 2015.

Mapping from frame-driven to frame-free event-driven vision systems by low-rate rate coding and coincidence processing--application to feedforward ConvNets.

IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2706-19. doi: 10.1109/TPAMI.2013.71.

Pulse-modulation imaging-review and performance analysis.

IEEE Trans Biomed Circuits Syst. 2011 Feb;5(1):64-82. doi: 10.1109/TBCAS.2010.2075929.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于事件的视觉的极早期图像识别。

Extreme Early Image Recognition Using Event-Based Vision.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献