• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines.混合CPU-GPU机器上的高效不规则波前传播算法
Parallel Comput. 2013 Apr 1;39(4-5):189-211. doi: 10.1016/j.parco.2013.03.001.
2
Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with Intel Xeon Phi™.在配备英特尔至强融核™的混合机上对不规则波前传播模式进行协同和核外执行。
Concurr Comput. 2018 Jul 25;30(14). doi: 10.1002/cpe.4425. Epub 2018 Jan 24.
3
Efficient irregular wavefront propagation algorithms on Intel Xeon Phi.英特尔至强融核处理器上的高效不规则波前传播算法
Proc Symp Comput Archit High Perform Comput. 2015 Oct;2015:25-32. doi: 10.1109/SBAC-PAD.2015.13.
4
Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs, GPUs and MICs: A Case Study with Microscopy Image Analysis.多核CPU、GPU和MIC系统上的应用性能分析与高效执行:以显微镜图像分析为例
Int J High Perform Comput Appl. 2017 Jan;31(1):32-51. doi: 10.1177/1094342015594519. Epub 2015 Jul 27.
5
A fast forward projection using multithreads for multirays on GPUs in medical image reconstruction.基于 GPU 的医学图像重建中多线程快速前向投影的多射线算法。
Med Phys. 2011 Jul;38(7):4052-65. doi: 10.1118/1.3591994.
6
High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms.在CPU-GPU集群平台上对大型显微镜图像数据集进行高通量分析。
Proc IPDPS (Conf). 2013 May;2013:103-114. doi: 10.1109/IPDPS.2013.11.
7
Accelerating Spatial Cross-Matching on CPU-GPU Hybrid Platform With CUDA and OpenACC.利用CUDA和OpenACC在CPU-GPU混合平台上加速空间交叉匹配
Front Big Data. 2020 May;3. doi: 10.3389/fdata.2020.00014. Epub 2020 May 8.
8
A hybrid CPU/GPU method for Hartree-Fock self-consistent-field calculation.一种用于哈特里-福克自洽场计算的混合CPU/ GPU方法。
J Chem Phys. 2023 Sep 14;159(10). doi: 10.1063/5.0156934.
9
Hybrid computing: CPU+GPU co-processing and its application to tomographic reconstruction.混合计算:CPU+GPU 协同处理及其在断层重建中的应用。
Ultramicroscopy. 2012 Apr;115:109-14. doi: 10.1016/j.ultramic.2012.02.003. Epub 2012 Feb 18.
10
CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions.CUDASW++ 3.0:通过结合 CPU 和 GPU 的 SIMD 指令来加速 Smith-Waterman 蛋白质数据库搜索。
BMC Bioinformatics. 2013 Apr 4;14:117. doi: 10.1186/1471-2105-14-117.

引用本文的文献

1
Optimizing parameter sensitivity analysis of large-scale microscopy image analysis workflows with multilevel computation reuse.利用多级计算复用优化大规模显微镜图像分析工作流程的参数敏感性分析。
Concurr Comput. 2020 Jan 25;32(2). doi: 10.1002/cpe.5403. Epub 2019 Jun 24.
2
Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with Intel Xeon Phi™.在配备英特尔至强融核™的混合机上对不规则波前传播模式进行协同和核外执行。
Concurr Comput. 2018 Jul 25;30(14). doi: 10.1002/cpe.4425. Epub 2018 Jan 24.
3
Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs, GPUs and MICs: A Case Study with Microscopy Image Analysis.多核CPU、GPU和MIC系统上的应用性能分析与高效执行:以显微镜图像分析为例
Int J High Perform Comput Appl. 2017 Jan;31(1):32-51. doi: 10.1177/1094342015594519. Epub 2015 Jul 27.
4
Efficient irregular wavefront propagation algorithms on Intel Xeon Phi.英特尔至强融核处理器上的高效不规则波前传播算法
Proc Symp Comput Archit High Perform Comput. 2015 Oct;2015:25-32. doi: 10.1109/SBAC-PAD.2015.13.
5
Efficient Execution of Microscopy Image Analysis on CPU, GPU, and MIC Equipped Cluster Systems.在配备CPU、GPU和MIC的集群系统上高效执行显微镜图像分析
Proc Symp Comput Archit High Perform Comput. 2014 Oct;2014:89-96. doi: 10.1109/SBAC-PAD.2014.15.
6
Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies.使用高效方法和高性能计算策略对大型病理学图像数据群组进行可扩展分析。
BMC Bioinformatics. 2015 Dec 1;16:399. doi: 10.1186/s12859-015-0831-6.
7
Region Templates: Data Representation and Management for High-Throughput Image Analysis.区域模板:用于高通量图像分析的数据表示与管理
Parallel Comput. 2014 Dec 1;40(10):589-610. doi: 10.1016/j.parco.2014.09.003.
8
High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms.在CPU-GPU集群平台上对大型显微镜图像数据集进行高通量分析。
Proc IPDPS (Conf). 2013 May;2013:103-114. doi: 10.1109/IPDPS.2013.11.
9
Comparative Performance Analysis of Intel Xeon Phi, GPU, and CPU: A Case Study from Microscopy Image Analysis.英特尔至强融核处理器、图形处理器和中央处理器的性能对比分析:来自显微镜图像分析的案例研究
IEEE Trans Parallel Distrib Syst. 2014 May;2014:1063-1072. doi: 10.1109/IPDPS.2014.111.
10
High-Performance Computational Analysis of Glioblastoma Pathology Images with Database Support Identifies Molecular and Survival Correlates.基于数据库支持的胶质母细胞瘤病理图像高性能计算分析确定分子与生存相关性
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2013 Dec:229-236. doi: 10.1109/BIBM.2013.6732495.

本文引用的文献

1
Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems.在配备了CPU - GPU的并行系统上加速大规模图像分析
Proc IPDPS (Conf). 2012 May;2012:1093-1104. doi: 10.1109/IPDPS.2012.101.
2
Morphological grayscale reconstruction in image analysis: applications and efficient algorithms.图像分析中的形态学灰度重建:应用与高效算法。
IEEE Trans Image Process. 1993;2(2):176-201. doi: 10.1109/83.217222.
3
The efficient algorithms for achieving Euclidean distance transformation.用于实现欧几里得距离变换的高效算法。
IEEE Trans Image Process. 2004 Aug;13(8):1078-91. doi: 10.1109/tip.2004.826098.

混合CPU-GPU机器上的高效不规则波前传播算法

Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines.

作者信息

Teodoro George, Pan Tony, Kurc Tahsin, Kong Jun, Cooper Lee, Saltz Joel

机构信息

Center for Comprehensive Informatics and Biomedical Informatics Department, Emory University, Atlanta, GA 30322.

出版信息

Parallel Comput. 2013 Apr 1;39(4-5):189-211. doi: 10.1016/j.parco.2013.03.001.

DOI:10.1016/j.parco.2013.03.001
PMID:23908562
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3727669/
Abstract

We address the problem of efficient execution of a computation pattern, referred to here as the irregular wavefront propagation pattern (IWPP), on hybrid systems with multiple CPUs and GPUs. The IWPP is common in several image processing operations. In the IWPP, data elements in the wavefront propagate waves to their neighboring elements on a grid if a propagation condition is satisfied. Elements receiving the propagated waves become part of the wavefront. This pattern results in irregular data accesses and computations. We develop and evaluate strategies for efficient computation and propagation of wavefronts using a multi-level queue structure. This queue structure improves the utilization of fast memories in a GPU and reduces synchronization overheads. We also develop a tile-based parallelization strategy to support execution on multiple CPUs and GPUs. We evaluate our approaches on a state-of-the-art GPU accelerated machine (equipped with 3 GPUs and 2 multicore CPUs) using the IWPP implementations of two widely used image processing operations: morphological reconstruction and euclidean distance transform. Our results show significant performance improvements on GPUs. The use of multiple CPUs and GPUs cooperatively attains speedups of 50× and 85× with respect to single core CPU executions for morphological reconstruction and euclidean distance transform, respectively.

摘要

我们解决了在具有多个CPU和GPU的混合系统上高效执行一种计算模式(这里称为不规则波前传播模式,即IWPP)的问题。IWPP在多种图像处理操作中很常见。在IWPP中,如果满足传播条件,波前中的数据元素会将波传播到网格上的相邻元素。接收到传播波的元素会成为波前的一部分。这种模式会导致不规则的数据访问和计算。我们开发并评估了使用多级队列结构来高效计算和传播波前的策略。这种队列结构提高了GPU中快速内存的利用率,并减少了同步开销。我们还开发了一种基于瓦片的并行化策略,以支持在多个CPU和GPU上执行。我们使用两种广泛使用的图像处理操作(形态学重建和欧几里得距离变换)的IWPP实现,在一台先进的GPU加速机器(配备3个GPU和2个多核CPU)上评估我们的方法。我们的结果表明在GPU上有显著的性能提升。对于形态学重建和欧几里得距离变换,分别使用多个CPU和GPU协同实现了相对于单核CPU执行50倍和85倍的加速。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/4d4bd76a6808/nihms-439013-f0016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/eb34d737ef3c/nihms-439013-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/81a477fa19a6/nihms-439013-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/d5c0bda56c07/nihms-439013-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/9b475b68262a/nihms-439013-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/4116f4e3ae13/nihms-439013-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/6e24e371c6d6/nihms-439013-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/89f3c9b3f787/nihms-439013-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/5a0139a36440/nihms-439013-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/a96c14459dcb/nihms-439013-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/6107e2f775d3/nihms-439013-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/c5bc49ec6551/nihms-439013-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/d17909f8071b/nihms-439013-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/57d7c9e71ab9/nihms-439013-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/50524f252d9b/nihms-439013-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/e2939f8ee48b/nihms-439013-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/4d4bd76a6808/nihms-439013-f0016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/eb34d737ef3c/nihms-439013-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/81a477fa19a6/nihms-439013-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/d5c0bda56c07/nihms-439013-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/9b475b68262a/nihms-439013-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/4116f4e3ae13/nihms-439013-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/6e24e371c6d6/nihms-439013-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/89f3c9b3f787/nihms-439013-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/5a0139a36440/nihms-439013-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/a96c14459dcb/nihms-439013-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/6107e2f775d3/nihms-439013-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/c5bc49ec6551/nihms-439013-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/d17909f8071b/nihms-439013-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/57d7c9e71ab9/nihms-439013-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/50524f252d9b/nihms-439013-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/e2939f8ee48b/nihms-439013-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/438b/3727669/4d4bd76a6808/nihms-439013-f0016.jpg