• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

边缘设备上用于人工智能应用的异构内存内处理的动态性能与功耗优化

Dynamic Performance and Power Optimization with Heterogeneous Processing-in-Memory for AI Applications on Edge Devices.

作者信息

Jeon Sangmin, Lee Kangju, Lee Kyeongwon, Lee Woojoo

机构信息

Department of Intelligent Semiconductor Engineering, Chung-Ang University, 84, Heukseok-ro, Dongjak-gu, Seoul 06974, Republic of Korea.

出版信息

Micromachines (Basel). 2024 Sep 30;15(10):1222. doi: 10.3390/mi15101222.

DOI:10.3390/mi15101222
PMID:39459096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11509660/
Abstract

The rapid advancement of artificial intelligence (AI) technology, combined with the widespread proliferation of Internet of Things (IoT) devices, has significantly expanded the scope of AI applications, from data centers to edge devices. Running AI applications on edge devices requires a careful balance between data processing performance and energy efficiency. This challenge becomes even more critical when the computational load of applications dynamically changes over time, making it difficult to maintain optimal performance and energy efficiency simultaneously. To address these challenges, we propose a novel processing-in-memory (PIM) technology that dynamically optimizes performance and power consumption in response to real-time workload variations in AI applications. Our proposed solution consists of a new PIM architecture and an operational algorithm designed to maximize its effectiveness. The PIM architecture follows a well-established structure known for effectively handling data-centric tasks in AI applications. However, unlike conventional designs, it features a heterogeneous configuration of high-performance PIM (HP-PIM) modules and low-power PIM (LP-PIM) modules. This enables the system to dynamically adjust data processing based on varying computational load, optimizing energy efficiency according to the application's workload demands. In addition, we present a data placement optimization algorithm to fully leverage the potential of the heterogeneous PIM architecture. This algorithm predicts changes in application workloads and optimally allocates data to the HP-PIM and LP-PIM modules, improving energy efficiency. To validate and evaluate the proposed technology, we implemented the PIM architecture and developed an embedded processor that integrates this architecture. We performed FPGA prototyping of the processor, and functional verification was successfully completed. Experimental results from running applications with varying workload demands on the prototype PIM processor demonstrate that the proposed technology achieves up to 29.54% energy savings.

摘要

人工智能(AI)技术的迅速发展,再加上物联网(IoT)设备的广泛普及,显著扩大了AI应用的范围,从数据中心到边缘设备。在边缘设备上运行AI应用需要在数据处理性能和能源效率之间仔细权衡。当应用的计算负载随时间动态变化时,这一挑战变得更加关键,使得难以同时维持最佳性能和能源效率。为应对这些挑战,我们提出了一种新颖的内存处理(PIM)技术,该技术可根据AI应用中的实时工作负载变化动态优化性能和功耗。我们提出的解决方案包括一种新的PIM架构和一种旨在最大化其有效性的操作算法。该PIM架构遵循一种成熟的结构,以有效处理AI应用中以数据为中心的任务而闻名。然而,与传统设计不同的是,它具有高性能PIM(HP-PIM)模块和低功耗PIM(LP-PIM)模块的异构配置。这使系统能够根据变化的计算负载动态调整数据处理,根据应用的工作负载需求优化能源效率。此外,我们提出了一种数据放置优化算法,以充分利用异构PIM架构的潜力。该算法预测应用工作负载的变化,并将数据最优地分配到HP-PIM和LP-PIM模块,提高能源效率。为了验证和评估所提出的技术,我们实现了PIM架构并开发了集成该架构的嵌入式处理器。我们对该处理器进行了FPGA原型设计,并成功完成了功能验证。在原型PIM处理器上运行具有不同工作负载需求的应用的实验结果表明,所提出的技术可实现高达29.54%的节能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/3e39582401cf/micromachines-15-01222-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/f0428846346f/micromachines-15-01222-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/65b1f73d0fe3/micromachines-15-01222-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/3e95866ed1cd/micromachines-15-01222-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/a96ef5baa849/micromachines-15-01222-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/0f5c3e8d3f85/micromachines-15-01222-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/13bc2a2495ee/micromachines-15-01222-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/6fb37334a4f4/micromachines-15-01222-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/3e39582401cf/micromachines-15-01222-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/f0428846346f/micromachines-15-01222-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/65b1f73d0fe3/micromachines-15-01222-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/3e95866ed1cd/micromachines-15-01222-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/a96ef5baa849/micromachines-15-01222-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/0f5c3e8d3f85/micromachines-15-01222-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/13bc2a2495ee/micromachines-15-01222-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/6fb37334a4f4/micromachines-15-01222-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89d6/11509660/3e39582401cf/micromachines-15-01222-g008.jpg

相似文献

1
Dynamic Performance and Power Optimization with Heterogeneous Processing-in-Memory for AI Applications on Edge Devices.边缘设备上用于人工智能应用的异构内存内处理的动态性能与功耗优化
Micromachines (Basel). 2024 Sep 30;15(10):1222. doi: 10.3390/mi15101222.
2
A Processing-in-Memory Architecture Programming Paradigm for Wireless Internet-of-Things Applications.面向无线物联网应用的存内处理体系结构编程范式。
Sensors (Basel). 2019 Jan 3;19(1):140. doi: 10.3390/s19010140.
3
Optimizing Internet of Things Fog Computing: Through Lyapunov-Based Long Short-Term Memory Particle Swarm Optimization Algorithm for Energy Consumption Optimization.优化物联网雾计算:通过基于李雅普诺夫的长短期记忆粒子群优化算法实现能耗优化。
Sensors (Basel). 2024 Feb 10;24(4):1165. doi: 10.3390/s24041165.
4
Design and Implementation of Cloud Analytics-Assisted Smart Power Meters Considering Advanced Artificial Intelligence as Edge Analytics in Demand-Side Management for Smart Homes.考虑将先进人工智能作为智能家居需求侧管理中的边缘分析的云分析辅助智能电表的设计与实现
Sensors (Basel). 2019 May 2;19(9):2047. doi: 10.3390/s19092047.
5
An Ultra-Low Power Reconfigurable Biomedical AI Processor With Adaptive Learning for Versatile Wearable Intelligent Health Monitoring.一种具有自适应学习功能的超低功耗可重构生物医学人工智能处理器,用于多功能可穿戴智能健康监测。
IEEE Trans Biomed Circuits Syst. 2023 Oct;17(5):952-967. doi: 10.1109/TBCAS.2023.3276782. Epub 2023 Nov 21.
6
Using Chiplet Encapsulation Technology to Achieve Processing-in-Memory Functions.使用小芯片封装技术实现内存处理功能。
Micromachines (Basel). 2022 Oct 20;13(10):1790. doi: 10.3390/mi13101790.
7
Artificial Intelligence-Based Semantic Internet of Things in a User-Centric Smart City.以用户为中心的智慧城市中基于人工智能的语义物联网
Sensors (Basel). 2018 Apr 26;18(5):1341. doi: 10.3390/s18051341.
8
Edge Caching Based on Collaborative Filtering for Heterogeneous ICN-IoT Applications.基于协同过滤的边缘缓存用于异构信息中心网络物联网应用
Sensors (Basel). 2021 Aug 15;21(16):5491. doi: 10.3390/s21165491.
9
Research on Convolutional Neural Network Inference Acceleration and Performance Optimization for Edge Intelligence.面向边缘智能的卷积神经网络推理加速与性能优化研究
Sensors (Basel). 2023 Dec 31;24(1):240. doi: 10.3390/s24010240.
10
Load Balancing Using Artificial Intelligence for Cloud-Enabled Internet of Everything in Healthcare Domain.利用人工智能进行负载均衡,实现医疗领域中云支持的万物互联网。
Sensors (Basel). 2023 Jun 5;23(11):5349. doi: 10.3390/s23115349.

本文引用的文献

1
A Survey of Emerging Memory in a Microcontroller Unit.微控制器中新兴存储器的调查。
Micromachines (Basel). 2024 Apr 1;15(4):488. doi: 10.3390/mi15040488.
2
Simulation of a Fully Digital Computing-in-Memory for Non-Volatile Memory for Artificial Intelligence Edge Applications.用于人工智能边缘应用的非易失性存储器全数字内存计算模拟
Micromachines (Basel). 2023 May 31;14(6):1175. doi: 10.3390/mi14061175.
3
At the Confluence of Artificial Intelligence and Edge Computing in IoT-Based Applications: A Review and New Perspectives.
在基于物联网应用的人工智能和边缘计算的融合:综述与新视角。
Sensors (Basel). 2023 Feb 2;23(3):1639. doi: 10.3390/s23031639.
4
TinyML: Enabling of Inference Deep Learning Models on Ultra-Low-Power IoT Edge Devices for AI Applications.TinyML:在超低功耗物联网边缘设备上实现用于人工智能应用的推理深度学习模型。
Micromachines (Basel). 2022 May 29;13(6):851. doi: 10.3390/mi13060851.
5
New Logic-In-Memory Paradigms: An Architectural and Technological Perspective.新型内存逻辑范式:架构与技术视角
Micromachines (Basel). 2019 May 31;10(6):368. doi: 10.3390/mi10060368.