• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于可配置多层感知器的嵌入式现场可编程门阵列软传感器:面向流体流量估计中的多样部署目标

Configurable Multi-Layer Perceptron-Based Soft Sensors on Embedded Field Programmable Gate Arrays: Targeting Diverse Deployment Goals in Fluid Flow Estimation.

作者信息

Ling Tianheng, Qian Chao, Klann Theodor Mario, Hoever Julian, Einhaus Lukas, Schiele Gregor

机构信息

Intelligent Embedded Systems of Computer Science, University of Duisburg-Essen, 47057 Duisburg, Germany.

出版信息

Sensors (Basel). 2024 Dec 26;25(1):83. doi: 10.3390/s25010083.

DOI:10.3390/s25010083
PMID:39796874
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11722680/
Abstract

This study presents a comprehensive workflow for developing and deploying Multi-Layer Perceptron (MLP)-based soft sensors on embedded FPGAs, addressing diverse deployment objectives. The proposed workflow extends our prior research by introducing greater model adaptability. It supports various configurations-spanning layer counts, neuron counts, and quantization bitwidths-to accommodate the constraints and capabilities of different FPGA platforms. The workflow incorporates a custom-developed, open-source toolchain that facilitates quantization-aware training, integer-only inference, automated accelerator generation using VHDL templates, and synthesis alongside performance estimation. A case study on fluid flow estimation was conducted on two FPGA platforms: the AMD Spartan-7 XC7S15 and the Lattice iCE40UP5K. For precision-focused and latency-sensitive deployments, a six-layer, 60-neuron MLP accelerator quantized to 8 bits on the XC7S15 achieved an MSE of 56.56, an MAPE of 1.61%, and an inference latency of 23.87 μs. Moreover, for low-power and energy-constrained deployments, a five-layer, 30-neuron MLP accelerator quantized to 8 bits on the iCE40UP5K achieved an inference latency of 83.37 μs, a power consumption of 2.06 mW, and an energy consumption of just 0.172 μJ per inference. These results confirm the workflow's ability to identify optimal FPGA accelerators tailored to specific deployment requirements, achieving a balanced trade-off between precision, inference latency, and energy efficiency.

摘要

本研究提出了一种用于在嵌入式FPGA上开发和部署基于多层感知器(MLP)的软传感器的综合工作流程,以满足不同的部署目标。所提出的工作流程通过引入更高的模型适应性扩展了我们之前的研究。它支持各种配置——包括层数、神经元数量和量化比特宽度——以适应不同FPGA平台的约束和能力。该工作流程包含一个定制开发的开源工具链,该工具链有助于量化感知训练、仅整数推理、使用VHDL模板自动生成加速器以及进行综合并估计性能。在两个FPGA平台上进行了流体流动估计的案例研究:AMD Spartan-7 XC7S15和Lattice iCE40UP5K。对于注重精度和对延迟敏感的部署,在XC7S15上量化为8位的六层、60神经元MLP加速器实现了56.56的均方误差(MSE)、1.61%的平均绝对百分比误差(MAPE)和23.87微秒的推理延迟。此外,对于低功耗和能量受限的部署,在iCE40UP5K上量化为8位的五层、30神经元MLP加速器实现了83.37微秒的推理延迟、2.06毫瓦的功耗以及每次推理仅0.172微焦耳的能量消耗。这些结果证实了该工作流程能够识别针对特定部署要求量身定制的最佳FPGA加速器,在精度、推理延迟和能源效率之间实现了平衡的权衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/0a8348e82849/sensors-25-00083-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/4fdfd5404287/sensors-25-00083-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/a46ef1c894a3/sensors-25-00083-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/2246d6036ef7/sensors-25-00083-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/bc91bd1cd5a4/sensors-25-00083-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/c485e2547041/sensors-25-00083-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/7ffdd60fe8fb/sensors-25-00083-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/886b2b11f939/sensors-25-00083-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/42220e16f196/sensors-25-00083-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/a04337e4ea3a/sensors-25-00083-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/77005cda38be/sensors-25-00083-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/5dcadbaae22b/sensors-25-00083-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/7abfa94750a4/sensors-25-00083-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/7b658d8daf08/sensors-25-00083-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/f5dab7aae551/sensors-25-00083-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/bdb5434395b7/sensors-25-00083-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/d7055cdaadd3/sensors-25-00083-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/4bce2bd78274/sensors-25-00083-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/0a8348e82849/sensors-25-00083-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/4fdfd5404287/sensors-25-00083-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/a46ef1c894a3/sensors-25-00083-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/2246d6036ef7/sensors-25-00083-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/bc91bd1cd5a4/sensors-25-00083-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/c485e2547041/sensors-25-00083-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/7ffdd60fe8fb/sensors-25-00083-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/886b2b11f939/sensors-25-00083-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/42220e16f196/sensors-25-00083-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/a04337e4ea3a/sensors-25-00083-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/77005cda38be/sensors-25-00083-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/5dcadbaae22b/sensors-25-00083-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/7abfa94750a4/sensors-25-00083-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/7b658d8daf08/sensors-25-00083-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/f5dab7aae551/sensors-25-00083-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/bdb5434395b7/sensors-25-00083-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/d7055cdaadd3/sensors-25-00083-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/4bce2bd78274/sensors-25-00083-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe0/11722680/0a8348e82849/sensors-25-00083-g018.jpg

相似文献

1
Configurable Multi-Layer Perceptron-Based Soft Sensors on Embedded Field Programmable Gate Arrays: Targeting Diverse Deployment Goals in Fluid Flow Estimation.基于可配置多层感知器的嵌入式现场可编程门阵列软传感器:面向流体流量估计中的多样部署目标
Sensors (Basel). 2024 Dec 26;25(1):83. doi: 10.3390/s25010083.
2
Quantization-Aware NN Layers with High-throughput FPGA Implementation for Edge AI.具有高吞吐量 FPGA 实现的量化感知神经网络层,用于边缘人工智能。
Sensors (Basel). 2023 May 11;23(10):4667. doi: 10.3390/s23104667.
3
Acceleration of Deep Neural Network Training Using Field Programmable Gate Arrays.使用现场可编程门阵列加速深度神经网络训练。
Comput Intell Neurosci. 2022 Oct 17;2022:8387364. doi: 10.1155/2022/8387364. eCollection 2022.
4
FPGA-Based Hybrid-Type Implementation of Quantized Neural Networks for Remote Sensing Applications.基于 FPGA 的量化神经网络混合式实现及其在遥感中的应用。
Sensors (Basel). 2019 Feb 22;19(4):924. doi: 10.3390/s19040924.
5
Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks.使用 FPGA 加速神经网络进行医疗诊断的实时数据分析。
BMC Bioinformatics. 2018 Dec 21;19(Suppl 18):490. doi: 10.1186/s12859-018-2505-7.
6
AutoMLP: A Framework for the Acceleration of Multi-Layer Perceptron Models on FPGAs for Real-Time Atrial Fibrillation Disease Detection.AutoMLP:一种用于在FPGA上加速多层感知器模型以进行实时心房颤动疾病检测的框架。
IEEE Trans Biomed Circuits Syst. 2023 Dec;17(6):1371-1386. doi: 10.1109/TBCAS.2023.3299084. Epub 2024 Jan 10.
7
Toward Full-Stack Acceleration of Deep Convolutional Neural Networks on FPGAs.深度卷积神经网络在 FPGAs 上的全栈加速。
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3974-3987. doi: 10.1109/TNNLS.2021.3055240. Epub 2022 Aug 3.
8
Flare: An FPGA-Based Full Precision Low Power CNN Accelerator with Reconfigurable Structure.Flare:一种基于现场可编程门阵列(FPGA)的具有可重构结构的全精度低功耗卷积神经网络(CNN)加速器。
Sensors (Basel). 2024 Mar 31;24(7):2239. doi: 10.3390/s24072239.
9
A Post-training Quantization Method for the Design of Fixed-Point-Based FPGA/ASIC Hardware Accelerators for LSTM/GRU Algorithms.一种针对 LSTM/GRU 算法的基于定点的 FPGA/ASIC 硬件加速器设计的后训练量化方法。
Comput Intell Neurosci. 2022 May 11;2022:9485933. doi: 10.1155/2022/9485933. eCollection 2022.
10
A Heterogeneous Hardware Accelerator for Image Classification in Embedded Systems.面向嵌入式系统图像分类的异构硬件加速器。
Sensors (Basel). 2021 Apr 9;21(8):2637. doi: 10.3390/s21082637.

本文引用的文献

1
Experimental implementation of a neural network optical channel equalizer in restricted hardware using pruning and quantization.使用剪枝和量化技术在受限硬件中实现神经网络光通道均衡器的实验。
Sci Rep. 2022 May 24;12(1):8713. doi: 10.1038/s41598-022-12563-0.
2
Two Experimental Protocols for Accurate Measurement of Gas Component Uptake and Production Rates in Bioconversion Processes.两种用于准确测量生物转化过程中气体成分吸收和生成速率的实验方案。
Sci Rep. 2019 Apr 11;9(1):5899. doi: 10.1038/s41598-019-42469-3.
3
Estimation of combined sewer overflow discharge: a software sensor approach based on local water level measurements.
合流制下水道溢流排放估算:一种基于本地水位测量的软件传感器方法。
Water Sci Technol. 2016 Dec;74(11):2683-2696. doi: 10.2166/wst.2016.361.