Suppr超能文献

TurboBC:一种基于线性代数语言的、内存高效且可扩展的基于GPU的介数中心性算法。

TurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality Algorithm in the Language of Linear Algebra.

作者信息

Artiles Oswaldo, Saeed Fahad

机构信息

School of Computing and Information Sciences, Florida, International University, Miami, Florida, USA.

出版信息

Proc Int Workshops Parallel Proc. 2021 Aug;2021. doi: 10.1145/3458744.3474047. Epub 2021 Sep 23.

Abstract

Betweenness centrality (BC) is a shortest path centrality metric used to measure the influence of individual vertices or edges on huge graphs that are used for modeling and analysis of human brain, omics data, or social networks. The application of the BC algorithm to modern graphs must deal with the size of the graphs, as well with highly irregular data-access patterns. These challenges are particularly important when the BC algorithm is implemented on Graphics Processing Units (GPU), due to the limited global memory of these processors, as well as the decrease in performance due to the load unbalance resulting from processing irregular data structures. In this paper, we present the first GPU based linear-algebraic formulation and implementation of BC, called TurboBC, a set of memory efficient BC algorithms that exhibits good performance and high scalability on unweighted, undirected or directed sparse graphs of arbitrary structure. Our experiments demonstrate that our TurboBC algorithms obtain more than 18 GTEPs and an average speedup of 31.9x over the sequential version of the BC algorithm, and are on average 1.7x and 2.2x faster than the state-of-the-art algorithms implemented on the high performance, GPU-based, gunrock, and CPU-based, ligra libraries, respectively. These experiments also show that by minimizing their memory footprint, the TurboBC algorithms are able to compute the BC of relatively big graphs, for which the gunrock algorithms ran out of memory.

摘要

介数中心性(BC)是一种最短路径中心性度量,用于衡量单个顶点或边对用于人类大脑建模与分析、组学数据或社交网络的大型图的影响。将BC算法应用于现代图时,必须处理图的规模以及高度不规则的数据访问模式。当在图形处理单元(GPU)上实现BC算法时,这些挑战尤为重要,因为这些处理器的全局内存有限,而且处理不规则数据结构会导致负载不平衡,进而使性能下降。在本文中,我们提出了首个基于GPU的BC线性代数公式和实现,称为TurboBC,这是一组内存高效的BC算法,在任意结构的无加权、无向或有向稀疏图上表现出良好的性能和高可扩展性。我们的实验表明,我们的TurboBC算法获得了超过18 GTEP的性能,比BC算法的顺序版本平均加速31.9倍,并且分别比基于高性能GPU的gunrock库和基于CPU的ligra库实现的现有算法平均快1.7倍和2.2倍。这些实验还表明,通过最小化内存占用,TurboBC算法能够计算相对较大的图的BC,而gunrock算法在处理这些图时会出现内存不足的情况。

相似文献

2
TurboBFS: GPU Based Breadth-First Search (BFS) Algorithms in the Language of Linear Algebra.TurboBFS:基于GPU的线性代数语言广度优先搜索(BFS)算法
IEEE Int Symp Parallel Distrib Process Workshops Phd Forum. 2021 Jun;2021:520-528. doi: 10.1109/ipdpsw52791.2021.00084. Epub 2021 Jun 24.
3
Fast network centrality analysis using GPUs.利用 GPU 实现快速网络中心性分析。
BMC Bioinformatics. 2011 May 12;12:149. doi: 10.1186/1471-2105-12-149.
10
Querying large graphs in biomedicine with colored graphs and decomposition.用着色图和分解查询生物医学中的大型图
J Biomed Inform. 2020 Aug;108:103503. doi: 10.1016/j.jbi.2020.103503. Epub 2020 Jul 17.

本文引用的文献

1
TurboBFS: GPU Based Breadth-First Search (BFS) Algorithms in the Language of Linear Algebra.TurboBFS:基于GPU的线性代数语言广度优先搜索(BFS)算法
IEEE Int Symp Parallel Distrib Process Workshops Phd Forum. 2021 Jun;2021:520-528. doi: 10.1109/ipdpsw52791.2021.00084. Epub 2021 Jun 24.
2
SNAP: A General Purpose Network Analysis and Graph Mining Library.SNAP:一个通用的网络分析和图挖掘库。
ACM Trans Intell Syst Technol. 2016 Oct;8(1). doi: 10.1145/2898361. Epub 2016 Oct 3.
3
Computational solutions for omics data.计算方法在组学数据中的应用。
Nat Rev Genet. 2013 May;14(5):333-46. doi: 10.1038/nrg3433.
4
Fast network centrality analysis using GPUs.利用 GPU 实现快速网络中心性分析。
BMC Bioinformatics. 2011 May 12;12:149. doi: 10.1186/1471-2105-12-149.
5
Complex network measures of brain connectivity: uses and interpretations.脑连接复杂网络度量:用途与解读。
Neuroimage. 2010 Sep;52(3):1059-69. doi: 10.1016/j.neuroimage.2009.10.003. Epub 2009 Oct 9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验