• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多GPU环境下大型数据库表的高效连接算法

Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment.

作者信息

Rui Ran, Li Hao, Tu Yi-Cheng

机构信息

University of South Florida, Tampa, Florida, USA.

出版信息

Proceedings VLDB Endowment. 2020 Dec;14(4):708-720. doi: 10.14778/3436905.3436927. Epub 2020 Dec 1.

DOI:10.14778/3436905.3436927
PMID:38260211
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10803061/
Abstract

Relational join processing is one of the core functionalities in database management systems. It has been demonstrated that GPUs as a general-purpose parallel computing platform is very promising in processing relational joins. However, join algorithms often need to handle very large input data, which is an issue that was not sufficiently addressed in existing work. Besides, as more and more desktop and workstation platforms support multi-GPU environment, the combined computing capability of multiple GPUs can easily achieve that of a computing cluster. It is worth exploring how join processing would benefit from the adaptation of multiple GPUs. We identify the low rate and complex patterns of data transfer among the CPU and GPUs as the main challenges in designing efficient algorithms for large table joins. To overcome such challenges, we propose three distinctive designs of multi-GPU join algorithms, namely, the nested loop, global sort-merge and hybrid joins for large table joins with different join conditions. Extensive experiments running on multiple databases and two different hardware configurations demonstrate high scalability of our algorithms over data size and significant performance boost brought by the use of multiple GPUs. Furthermore, our algorithms achieve much better performance as compared to existing join algorithms, with a speedup up to 25X and 2.8X over best known code developed for multi-core CPUs and GPUs respectively.

摘要

关系连接处理是数据库管理系统的核心功能之一。事实证明,图形处理器(GPU)作为通用并行计算平台,在处理关系连接方面很有前景。然而,连接算法通常需要处理非常大的输入数据,而这一问题在现有工作中并未得到充分解决。此外,随着越来越多的桌面和工作站平台支持多GPU环境,多个GPU的组合计算能力很容易达到计算集群的水平。探索连接处理如何从多GPU的应用中受益是值得的。我们将CPU和GPU之间低速率且复杂的数据传输模式确定为设计大型表连接高效算法的主要挑战。为了克服这些挑战,我们针对具有不同连接条件的大型表连接,提出了三种独特的多GPU连接算法设计,即嵌套循环、全局排序合并和混合连接。在多个数据库和两种不同硬件配置上进行的大量实验表明,我们的算法在数据大小方面具有很高的可扩展性,并且使用多个GPU带来了显著的性能提升。此外,与现有连接算法相比,我们的算法性能要好得多,分别比为多核CPU和GPU开发的最佳已知代码加速25倍和2.8倍。

相似文献

1
Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment.多GPU环境下大型数据库表的高效连接算法
Proceedings VLDB Endowment. 2020 Dec;14(4):708-720. doi: 10.14778/3436905.3436927. Epub 2020 Dec 1.
2
Fast Equi-Join Algorithms on GPUs: Design and Implementation.基于图形处理器的快速等值连接算法:设计与实现
Sci Stat Database Manag. 2017 Jun;2017. doi: 10.1145/3085504.3085521. Epub 2017 Jun 27.
3
Parallel beamlet dose calculation via beamlet contexts in a distributed multi-GPU framework.基于分布式多 GPU 框架中的束流子区域进行平行束流子剂量计算。
Med Phys. 2019 Aug;46(8):3719-3733. doi: 10.1002/mp.13651. Epub 2019 Jun 30.
4
Accelerating Coupled-Cluster Calculations with GPUs: An Implementation of the Density-Fitted CCSD(T) Approach for Heterogeneous Computing Architectures Using OpenMP Directives.利用GPU加速耦合簇计算:一种使用OpenMP指令在异构计算架构上实现密度拟合CCSD(T)方法的方案
J Chem Theory Comput. 2023 Nov 14;19(21):7640-7657. doi: 10.1021/acs.jctc.3c00876. Epub 2023 Oct 25.
5
Comprehensive framework of GPU-accelerated image reconstruction for photoacoustic computed tomography.GPU 加速的光声计算机断层成像图像重建的综合框架。
J Biomed Opt. 2024 Jun;29(6):066006. doi: 10.1117/1.JBO.29.6.066006. Epub 2024 Jun 6.
6
Accelerating epistasis analysis in human genetics with consumer graphics hardware.利用消费级图形硬件加速人类遗传学中的上位性分析。
BMC Res Notes. 2009 Jul 24;2:149. doi: 10.1186/1756-0500-2-149.
7
Multi-GPU implementation of a VMAT treatment plan optimization algorithm.容积调强放疗(VMAT)治疗计划优化算法的多图形处理器(Multi-GPU)实现
Med Phys. 2015 Jun;42(6):2841-52. doi: 10.1118/1.4919742.
8
Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems.在配备了CPU - GPU的并行系统上加速大规模图像分析
Proc IPDPS (Conf). 2012 May;2012:1093-1104. doi: 10.1109/IPDPS.2012.101.
9
NMF-mGPU: non-negative matrix factorization on multi-GPU systems.NMF-mGPU:多GPU系统上的非负矩阵分解
BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4.
10
Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines.混合CPU-GPU机器上的高效不规则波前传播算法
Parallel Comput. 2013 Apr 1;39(4-5):189-211. doi: 10.1016/j.parco.2013.03.001.

引用本文的文献

1
Dynamic Buffer Management in Massively Parallel Systems: The Power of Randomness.大规模并行系统中的动态缓冲区管理:随机性的力量
ACM Trans Parallel Comput. 2025 Mar;12(1). doi: 10.1145/3701623. Epub 2025 Feb 11.
2
ArcaDB: A Disaggregated Query Engine for Heterogenous Computational Environments.ArcaDB:一种用于异构计算环境的分布式查询引擎。
IEEE Int Conf Cloud Comput. 2024 Jul;2024:42-53. doi: 10.1109/cloud62652.2024.00015. Epub 2024 Aug 28.

本文引用的文献

1
Fast Equi-Join Algorithms on GPUs: Design and Implementation.基于图形处理器的快速等值连接算法:设计与实现
Sci Stat Database Manag. 2017 Jun;2017. doi: 10.1145/3085504.3085521. Epub 2017 Jun 27.