• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GAHLS:一种基于优化的图分析的高级综合框架。

GAHLS: an optimized graph analytics based high level synthesis framework.

作者信息

Xiao Yao, Nazarian Shahin, Bogdan Paul

机构信息

University of Southern California, Los Angeles, CA, 90089, USA.

出版信息

Sci Rep. 2023 Dec 19;13(1):22655. doi: 10.1038/s41598-023-48981-x.

DOI:10.1038/s41598-023-48981-x
PMID:38114657
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10730867/
Abstract

The urgent need for low latency, high-compute and low power on-board intelligence in autonomous systems, cyber-physical systems, robotics, edge computing, evolvable computing, and complex data science calls for determining the optimal amount and type of specialized hardware together with reconfigurability capabilities. With these goals in mind, we propose a novel comprehensive graph analytics based high level synthesis (GAHLS) framework that efficiently analyzes complex high level programs through a combined compiler-based approach and graph theoretic optimization and synthesizes them into message passing domain-specific accelerators. This GAHLS framework first constructs a compiler-assisted dependency graph (CaDG) from low level virtual machine (LLVM) intermediate representation (IR) of high level programs and converts it into a hardware friendly description representation. Next, the GAHLS framework performs a memory design space exploration while account for the identified computational properties from the CaDG and optimizing the system performance for higher bandwidth. The GAHLS framework also performs a robust optimization to identify the CaDG subgraphs with similar computational structures and aggregate them into intelligent processing clusters in order to optimize the usage of underlying hardware resources. Finally, the GAHLS framework synthesizes this compressed specialized CaDG into processing elements while optimizing the system performance and area metrics. Evaluations of the GAHLS framework on several real-life applications (e.g., deep learning, brain machine interfaces) demonstrate that it provides 14.27× performance improvements compared to state-of-the-art approaches such as LegUp 6.2.

摘要

在自主系统、信息物理系统、机器人技术、边缘计算、可演化计算和复杂数据科学中,对低延迟、高计算能力和低功耗的板载智能有着迫切需求,这就需要确定专用硬件的最佳数量和类型以及可重构能力。考虑到这些目标,我们提出了一种新颖的基于综合图分析的高级合成(GAHLS)框架,该框架通过基于编译器的方法和图论优化相结合,有效地分析复杂的高级程序,并将它们合成到消息传递领域特定的加速器中。这个GAHLS框架首先从高级程序的低级虚拟机(LLVM)中间表示(IR)构建一个编译器辅助的依赖图(CaDG),并将其转换为硬件友好的描述表示。接下来,GAHLS框架在考虑从CaDG中识别出的计算属性并为更高带宽优化系统性能的同时,进行内存设计空间探索。GAHLS框架还进行了稳健的优化,以识别具有相似计算结构的CaDG子图,并将它们聚合到智能处理集群中,以优化底层硬件资源的使用。最后,GAHLS框架将这个压缩的专用CaDG合成到处理元件中,同时优化系统性能和面积指标。在几个实际应用(如深度学习、脑机接口)上对GAHLS框架进行的评估表明,与诸如LegUp 6.2等现有方法相比,它提供了14.27倍的性能提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/f110bf81cab4/41598_2023_48981_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/fa3444888173/41598_2023_48981_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/f62cb90b7b19/41598_2023_48981_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/42068ad61542/41598_2023_48981_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/e011d872c510/41598_2023_48981_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/679568af032f/41598_2023_48981_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/2bea3438436f/41598_2023_48981_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/7381e175c16f/41598_2023_48981_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/604dc45b043a/41598_2023_48981_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/86f849cb8b68/41598_2023_48981_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/f110bf81cab4/41598_2023_48981_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/fa3444888173/41598_2023_48981_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/f62cb90b7b19/41598_2023_48981_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/42068ad61542/41598_2023_48981_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/e011d872c510/41598_2023_48981_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/679568af032f/41598_2023_48981_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/2bea3438436f/41598_2023_48981_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/7381e175c16f/41598_2023_48981_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/604dc45b043a/41598_2023_48981_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/86f849cb8b68/41598_2023_48981_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6566/10730867/f110bf81cab4/41598_2023_48981_Fig9_HTML.jpg

相似文献

1
GAHLS: an optimized graph analytics based high level synthesis framework.GAHLS:一种基于优化的图分析的高级综合框架。
Sci Rep. 2023 Dec 19;13(1):22655. doi: 10.1038/s41598-023-48981-x.
2
At the Confluence of Artificial Intelligence and Edge Computing in IoT-Based Applications: A Review and New Perspectives.在基于物联网应用的人工智能和边缘计算的融合:综述与新视角。
Sensors (Basel). 2023 Feb 2;23(3):1639. doi: 10.3390/s23031639.
3
Closed-Loop Neural Prostheses With On-Chip Intelligence: A Review and a Low-Latency Machine Learning Model for Brain State Detection.闭环神经假体与片上智能:脑状态检测的回顾与低延迟机器学习模型。
IEEE Trans Biomed Circuits Syst. 2021 Oct;15(5):877-897. doi: 10.1109/TBCAS.2021.3112756. Epub 2021 Dec 9.
4
FPGA-Based High-Performance Embedded Systems for Adaptive Edge Computing in Cyber-Physical Systems: The ARTICo³ Framework.基于 FPGA 的用于信息物理系统中自适应边缘计算的高性能嵌入式系统:ARTICo³ 框架。
Sensors (Basel). 2018 Jun 8;18(6):1877. doi: 10.3390/s18061877.
5
Graph Representation Learning Based on Specific Subgraphs for Biomedical Interaction Prediction.基于特定子图的图表示学习用于生物医学相互作用预测
IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1552-1564. doi: 10.1109/TCBB.2024.3402741. Epub 2024 Oct 9.
6
Graph representation learning in bioinformatics: trends, methods and applications.生物信息学中的图表示学习:趋势、方法和应用。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab340.
7
Neuromorphic Sentiment Analysis Using Spiking Neural Networks.基于尖峰神经网络的神经形态情绪分析。
Sensors (Basel). 2023 Sep 6;23(18):7701. doi: 10.3390/s23187701.
8
From Near-Sensor to In-Sensor: A State-of-the-Art Review of Embedded AI Vision Systems.从近传感器到传感器内:嵌入式人工智能视觉系统的最新综述
Sensors (Basel). 2024 Aug 22;24(16):5446. doi: 10.3390/s24165446.
9
The storage capacity of a directed graph and nodewise autonomous, ubiquitous learning.有向图的存储容量与节点自主、普适学习。
Front Comput Neurosci. 2023 Oct 19;17:1254355. doi: 10.3389/fncom.2023.1254355. eCollection 2023.
10
Deep Reinforcement Learning-Empowered Cost-Effective Federated Video Surveillance Management Framework.深度强化学习赋能的高性价比联邦视频监控管理框架
Sensors (Basel). 2024 Mar 27;24(7):2158. doi: 10.3390/s24072158.

本文引用的文献

1
Opportunities for neuromorphic computing algorithms and applications.神经形态计算算法与应用的机遇。
Nat Comput Sci. 2022 Jan;2(1):10-19. doi: 10.1038/s43588-021-00184-y. Epub 2022 Jan 31.
2
Insect-inspired AI for autonomous robots.受昆虫启发的人工智能用于自主机器人。
Sci Robot. 2022 Jun 15;7(67):eabl6334. doi: 10.1126/scirobotics.abl6334.
3
Deciphering the generating rules and functionalities of complex networks.解析复杂网络的生成规则和功能。
Sci Rep. 2021 Nov 25;11(1):22964. doi: 10.1038/s41598-021-02203-4.
4
The grand challenges of .···的重大挑战。
Sci Robot. 2018 Jan 31;3(14). doi: 10.1126/scirobotics.aar7650.
5
Reliable Multi-Fractal Characterization of Weighted Complex Networks: Algorithms and Implications.可靠的加权复杂网络多重分形特征化:算法与启示。
Sci Rep. 2017 Aug 8;7(1):7487. doi: 10.1038/s41598-017-07209-5.
6
Brain-computer interfaces for communication and rehabilitation.脑机接口用于通信和康复。
Nat Rev Neurol. 2016 Sep;12(9):513-25. doi: 10.1038/nrneurol.2016.113. Epub 2016 Aug 19.
7
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.
8
Optimization by simulated annealing.模拟退火优化。
Science. 1983 May 13;220(4598):671-80. doi: 10.1126/science.220.4598.671.