Tena Felix, Garnica Oscar, Davila Juan Lanchares, Hidalgo J Ignacio
IEEE J Biomed Health Inform. 2023 Aug 1;PP. doi: 10.1109/JBHI.2023.3300511.
This article proposes the first hardware implemen-tation of a low-power LSTM neural network targeting a wearable medical device designed to predict blood glucose at a 30-minute horizon. This work aims to reduce energy consumption by propos-ing new activation functions that target hardware implementation. On top of this proposal, we also prove there is room for improve-ment in energy consumption by applying neural network optimiza-tions at the algorithmic, such as quantization, and architecture level, LSTM hyperparameters, that consider the target hardware. To validate our proposal, we devise an optimized version of the neural network aimed to be wearable and, therefore, to reduce its energy consumption while preserving its accuracy as much as possible. The hardware is implemented on a Xilinx Virtex-7 FPGA VC707 Evaluation Kit. It is compared with (i) a faithful design of the original neural network implemented on the same evaluation kit, (ii) three state-of-the-art LSTM-based FPGA implementations, and (iii) software implementations running in cutting-edge smartphones:OnePlus NordTM and an Apple iPhone 13 ProTM with artificial in-telligence hardware accelerators. Our proposal consumes between ×1020 and ×7 less energy than the software implementations, being the most efficient system compared to the smartphones. On the other hand, its energy efficiency, measured in GFLOP/J, is between ×2.84 and ×7.82 greater than other state-of-the-art LSTM implementations, proving to be the most suitable implementation for a wearable system for blood glucose prediction.
本文提出了针对可穿戴医疗设备的低功耗长短期记忆(LSTM)神经网络的首个硬件实现方案,该设备旨在预测30分钟后的血糖水平。这项工作旨在通过提出针对硬件实现的新激活函数来降低能耗。在此方案之上,我们还证明,通过在算法层面(如量化)和架构层面(考虑目标硬件的LSTM超参数)应用神经网络优化,在能耗方面仍有改进空间。为了验证我们的方案,我们设计了一个优化版的神经网络,目标是实现可穿戴,从而在尽可能保持其准确性的同时降低能耗。硬件在赛灵思Virtex-7 FPGA VC707评估套件上实现。它与以下各项进行了比较:(i)在同一评估套件上实现的原始神经网络的忠实设计;(ii)三种基于LSTM的先进FPGA实现;以及(iii)在前沿智能手机(一加NordTM和配备人工智能硬件加速器的苹果iPhone 13 ProTM)上运行的软件实现。我们的方案比软件实现消耗的能量少1020倍至7倍,与智能手机相比是效率最高的系统。另一方面,以每焦耳千兆浮点运算(GFLOP/J)衡量,其能源效率比其他先进的LSTM实现高2.84倍至7.82倍,证明是用于血糖预测的可穿戴系统的最合适实现方案。