Suppr超能文献

HARP:面向视觉传感器中高性能计算的基于分层注意力的区域处理

HARP: Hierarchical Attention Oriented Region-Based Processing for High-Performance Computation in Vision Sensor.

作者信息

Bhowmik Pankaj, Pantho Md Jubaer Hossain, Bobda Christophe

机构信息

Electrical and Computer Engineering Department, University of Florida, Gainesville, FL 32603, USA.

出版信息

Sensors (Basel). 2021 Mar 4;21(5):1757. doi: 10.3390/s21051757.

Abstract

Cameras are widely adopted for high image quality with the rapid advancement of complementary metal-oxide-semiconductor (CMOS) image sensors while offloading vision applications' computation to the cloud. It raises concern for time-critical applications such as autonomous driving, surveillance, and defense systems since moving pixels from the sensor's focal plane are expensive. This paper presents a hardware architecture for smart cameras that understands the salient regions from an image frame and then performs high-level inference computation for sensor-level information creation instead of transporting raw pixels. A visual attention-oriented computational strategy helps to filter a significant amount of redundant spatiotemporal data collected at the focal plane. A computationally expensive learning model is then applied to the interesting regions of the image. The hierarchical processing in the pixels' data path demonstrates a bottom-up architecture with massive parallelism and gives high throughput by exploiting the large bandwidth available at the image source. We prototype the model in field-programmable gate array (FPGA) and application-specific integrated circuit (ASIC) for integrating with a pixel-parallel image sensor. The experiment results show that our approach achieves significant speedup while in certain conditions exhibits up to 45% more energy efficiency with the attention-oriented processing. Although there is an area overhead for inheriting attention-oriented processing, the achieved performance based on energy consumption, latency, and memory utilization overcomes that limitation.

摘要

随着互补金属氧化物半导体(CMOS)图像传感器的迅速发展,相机因其能提供高质量图像而被广泛采用,同时还能将视觉应用的计算任务卸载到云端。这引发了人们对自动驾驶、监控和国防系统等对时间要求苛刻的应用的担忧,因为从传感器焦平面移动像素成本高昂。本文提出了一种智能相机的硬件架构,该架构能够识别图像帧中的显著区域,然后执行高级推理计算以创建传感器级信息,而不是传输原始像素。一种面向视觉注意力的计算策略有助于过滤在焦平面收集的大量冗余时空数据。然后将计算成本高昂的学习模型应用于图像的感兴趣区域。像素数据路径中的分层处理展示了一种具有大规模并行性的自底向上架构,并通过利用图像源处可用的大带宽实现了高吞吐量。我们在现场可编程门阵列(FPGA)和专用集成电路(ASIC)中对该模型进行了原型设计,以便与像素并行图像传感器集成。实验结果表明,我们的方法实现了显著的加速,并且在某些条件下,通过面向注意力的处理,能效提高了多达45%。尽管继承面向注意力的处理会带来面积开销,但在能耗、延迟和内存利用率方面所实现的性能克服了这一限制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8da/7961745/31672c95ad96/sensors-21-01757-g001.jpg

相似文献

2
Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing.
Sensors (Basel). 2021 Mar 10;21(6):1955. doi: 10.3390/s21061955.
3
Motion-Based Object Location on a Smart Image Sensor Using On-Pixel Memory.
Sensors (Basel). 2022 Aug 30;22(17):6538. doi: 10.3390/s22176538.
4
A Selective Change Driven System for High-Speed Motion Analysis.
Sensors (Basel). 2016 Nov 8;16(11):1875. doi: 10.3390/s16111875.
5
Uniformity Correction of CMOS Image Sensor Modules for Machine Vision Cameras.
Sensors (Basel). 2022 Dec 12;22(24):9733. doi: 10.3390/s22249733.
6
Sensor-level computer vision with pixel processor arrays for agile robots.
Sci Robot. 2022 Jun 29;7(67):eabl7755. doi: 10.1126/scirobotics.abl7755.
7
From Near-Sensor to In-Sensor: A State-of-the-Art Review of Embedded AI Vision Systems.
Sensors (Basel). 2024 Aug 22;24(16):5446. doi: 10.3390/s24165446.
9
Neuromorphic-PM: processing-in-pixel-in-memory paradigm for neuromorphic image sensors.
Front Neuroinform. 2023 May 4;17:1144301. doi: 10.3389/fninf.2023.1144301. eCollection 2023.
10

引用本文的文献

1
An Analysis of Body Language of Patients Using Artificial Intelligence.
Healthcare (Basel). 2022 Dec 10;10(12):2504. doi: 10.3390/healthcare10122504.
2
FPGA-Based Pedestrian Detection for Collision Prediction System.
Sensors (Basel). 2022 Jun 11;22(12):4421. doi: 10.3390/s22124421.
3
Improved Multimedia Object Processing for the Internet of Vehicles.
Sensors (Basel). 2022 May 29;22(11):4133. doi: 10.3390/s22114133.

本文引用的文献

1
Event-Based Vision: A Survey.
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):154-180. doi: 10.1109/TPAMI.2020.3008413. Epub 2021 Dec 7.
2
Selective change driven imaging: a biomimetic visual sensing strategy.
Sensors (Basel). 2011;11(11):11000-20. doi: 10.3390/s111111000. Epub 2011 Nov 22.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验