Division of Information and Computing Technology, College of Science and Engineering, Hamad Bin Khalifa University, Doha P.O. Box 34110, Qatar.
Sensors (Basel). 2023 Jul 6;23(13):6195. doi: 10.3390/s23136195.
While deep learning algorithms have advanced to a great extent, they are all designed for frame-based imagers that capture images at a high frame rate, which leads to a high storage requirement, heavy computations, and very high power consumption. Unlike frame-based imagers, event-based imagers output asynchronous pixel events without the need for global exposure time, therefore lowering both power consumption and latency. In this paper, we propose an innovative image recognition technique that operates on image events rather than frame-based data, paving the way for a new paradigm of recognizing objects prior to image acquisition. To the best of our knowledge, this is the first time such a concept is introduced featuring not only extreme early image recognition but also reduced computational overhead, storage requirement, and power consumption. Our collected event-based dataset using CeleX imager and five public event-based datasets are used to prove this concept, and the testing metrics reflect how early the neural network (NN) detects an image before the full-frame image is captured. It is demonstrated that, on average for all the datasets, the proposed technique recognizes an image 38.7 ms before the first perfect event and 603.4 ms before the last event is received, which is a reduction of 34% and 69% of the time needed, respectively. Further, less processing is required as the image is recognized 9460 events earlier, which is 37% less than waiting for the first perfectly recognized image. An enhanced NN method is also introduced to reduce this time.
虽然深度学习算法已经取得了很大的进展,但它们都是为基于帧的成像仪设计的,这种成像仪以高帧率捕捉图像,这导致了高存储需求、大量计算和非常高的功耗。与基于帧的成像仪不同,事件型成像仪输出异步像素事件,而不需要全局曝光时间,因此降低了功耗和延迟。在本文中,我们提出了一种基于图像事件而不是基于帧数据的创新图像识别技术,为在图像采集之前识别物体开辟了新的范例。据我们所知,这是首次引入这样的概念,不仅具有极端的早期图像识别,而且减少了计算开销、存储需求和功耗。我们使用 CeleX 成像仪和五个公共事件型数据集收集的事件型数据集证明了这一概念,测试指标反映了神经网络(NN)在捕获全帧图像之前多早检测到图像。结果表明,对于所有数据集,平均而言,所提出的技术在首次接收到完美事件之前识别图像的时间提前了 38.7 毫秒,在接收到最后一个事件之前提前了 603.4 毫秒,分别减少了 34%和 69%的时间。此外,由于图像在更早的 9460 个事件中被识别,因此所需的处理更少,比等待第一个完全识别的图像少 37%。我们还引入了一种增强型神经网络方法来减少这种时间。