• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

像素处理器阵列中基于动态模型交换的片上二值化卷积神经网络推理

On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays.

作者信息

Liu Yanan, Bose Laurie, Fan Rui, Dudek Piotr, Mayol-Cuevas Walterio

机构信息

Bristol Robotics Laboratory, Faculty of Engineering, University of Bristol, Bristol, United Kingdom.

School of Microelectronics, Shanghai University, Shanghai, China.

出版信息

Front Neurosci. 2022 Aug 15;16:909448. doi: 10.3389/fnins.2022.909448. eCollection 2022.

DOI:10.3389/fnins.2022.909448
PMID:36046469
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9421154/
Abstract

Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance generality with efficiency. Among the most interesting camera-sensor architectures are Pixel Processor Arrays (PPAs). This study presents two methods that are useful for embedded CNNs in general but particularly suitable for PPAs. The first is for training purely binarized CNNs, the second is for deploying larger models with a model swapping paradigm that loads model components dynamically. Specifically, this study trains and implements networks with batch normalization and adaptive threshold for binary activations. Then, we convert batch normalization and binary activations into a bias matrix which can be parallelly implemented by an add/sub operation. For dynamic model swapping, we propose to decompose applications that are beyond the capacity of a PPA into sub-tasks that can be solved by tree networks that can be loaded dynamically as needed. We demonstrate our approaches to various tasks including classification, localization, and coarse segmentation on a highly resource constrained PPA sensor-processor.

摘要

近年来,人们提出了多种类型的卷积神经网络(CNN)模型和训练方法,旨在为计算和内存资源有限的嵌入式及边缘设备提供高效性。各种各样的架构使得这成为一项复杂的任务,必须在通用性和效率之间取得平衡。最引人关注的相机传感器架构之一是像素处理器阵列(PPA)。本研究提出了两种方法,它们一般对嵌入式CNN很有用,但特别适用于PPA。第一种方法用于训练纯二值化的CNN,第二种方法用于通过动态加载模型组件的模型交换范式来部署更大的模型。具体而言,本研究训练并实现了具有批量归一化和用于二值激活的自适应阈值的网络。然后,我们将批量归一化和二值激活转换为一个偏差矩阵,该矩阵可以通过加法/减法操作并行实现。对于动态模型交换,我们建议将超出PPA能力的应用分解为子任务,这些子任务可以由树状网络解决,树状网络可以根据需要动态加载。我们在资源高度受限的PPA传感器处理器上展示了我们针对各种任务(包括分类、定位和粗分割)的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a35/9421154/201a44e5fc42/fnins-16-909448-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a35/9421154/5a81c98b882b/fnins-16-909448-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a35/9421154/201a44e5fc42/fnins-16-909448-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a35/9421154/5a81c98b882b/fnins-16-909448-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a35/9421154/201a44e5fc42/fnins-16-909448-g0003.jpg

相似文献

1
On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays.像素处理器阵列中基于动态模型交换的片上二值化卷积神经网络推理
Front Neurosci. 2022 Aug 15;16:909448. doi: 10.3389/fnins.2022.909448. eCollection 2022.
2
Sensor-level computer vision with pixel processor arrays for agile robots.具有像素处理器阵列的传感器级计算机视觉,用于敏捷机器人。
Sci Robot. 2022 Jun 29;7(67):eabl7755. doi: 10.1126/scirobotics.abl7755.
3
Optical processor for a binarized neural network.用于二值化神经网络的光处理器。
Opt Lett. 2022 Aug 1;47(15):3892-3895. doi: 10.1364/OL.464214.
4
End-to-End Implementation of a Convolutional Neural Network on a 3D-Integrated Image Sensor with Macropixel Array.端到端在具有宏像素阵列的 3D 集成图像传感器上实现卷积神经网络。
Sensors (Basel). 2023 Feb 8;23(4):1909. doi: 10.3390/s23041909.
5
Hardware-Efficient Stochastic Binary CNN Architectures for Near-Sensor Computing.用于近传感器计算的硬件高效随机二值卷积神经网络架构
Front Neurosci. 2022 Jan 5;15:781786. doi: 10.3389/fnins.2021.781786. eCollection 2021.
6
Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing.面向支持传感器内处理的高效 CNN 推理架构。
Sensors (Basel). 2021 Mar 10;21(6):1955. doi: 10.3390/s21061955.
7
Pre-Computing Batch Normalisation Parameters for Edge Devices on a Binarized Neural Network.在二值化神经网络上为边缘设备预计算批量归一化参数。
Sensors (Basel). 2023 Jun 14;23(12):5556. doi: 10.3390/s23125556.
8
Visual Odometry Using Pixel Processor Arrays for Unmanned Aerial Systems in GPS Denied Environments.在全球定位系统(GPS)受限环境下,用于无人机系统的基于像素处理器阵列的视觉里程计
Front Robot AI. 2020 Sep 29;7:126. doi: 10.3389/frobt.2020.00126. eCollection 2020.
9
Design of Convolutional Neural Network Processor Based on FPGA Resource Multiplexing Architecture.基于 FPGA 资源复用架构的卷积神经网络处理器设计。
Sensors (Basel). 2022 Aug 10;22(16):5967. doi: 10.3390/s22165967.
10
Researching the CNN Collaborative Inference Mechanism for Heterogeneous Edge Devices.研究异构边缘设备的卷积神经网络协同推理机制。
Sensors (Basel). 2024 Jun 27;24(13):4176. doi: 10.3390/s24134176.

本文引用的文献

1
Visual Odometry Using Pixel Processor Arrays for Unmanned Aerial Systems in GPS Denied Environments.在全球定位系统(GPS)受限环境下,用于无人机系统的基于像素处理器阵列的视觉里程计
Front Robot AI. 2020 Sep 29;7:126. doi: 10.3389/frobt.2020.00126. eCollection 2020.
2
Neural Sensors: Learning Pixel Exposures for HDR Imaging and Video Compressive Sensing With Programmable Sensors.神经传感器:利用可编程传感器学习用于HDR成像和视频压缩感知的像素曝光
IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1642-1653. doi: 10.1109/TPAMI.2020.2986944. Epub 2020 Apr 13.
3
Fully Convolutional Networks for Semantic Segmentation.
全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.