• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于多序列比对的基于嵌入式的图形处理单元集群平台

Embedded-Based Graphics Processing Unit Cluster Platform for Multiple Sequence Alignments.

作者信息

Wei Jyh-Da, Cheng Hui-Jun, Lin Chun-Yuan, Ye Jin, Yeh Kuan-Yu

机构信息

Department of Computer Science and Information Engineering, School of Electrical and Computer Engineering, College of Engineering, Chang Gung University, Taoyuan, Taiwan.

Department of Ophthalmology, Chang Gung Memorial Hospital, Keelung, Taiwan.

出版信息

Evol Bioinform Online. 2017 Aug 8;13:1176934317724764. doi: 10.1177/1176934317724764. eCollection 2017.

DOI:10.1177/1176934317724764
PMID:28835734
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5555494/
Abstract

High-end graphics processing units (GPUs), such as NVIDIA Tesla/Fermi/Kepler series cards with thousands of cores per chip, are widely applied to high-performance computing fields in a decade. These desktop GPU cards should be installed in personal computers/servers with desktop CPUs, and the cost and power consumption of constructing a GPU cluster platform are very high. In recent years, NVIDIA releases an embedded board, called Jetson Tegra K1 (TK1), which contains 4 ARM Cortex-A15 CPUs and 192 Compute Unified Device Architecture cores (belong to Kepler GPUs). Jetson Tegra K1 has several advantages, such as the low cost, low power consumption, and high applicability, and it has been applied into several specific applications. In our previous work, a bioinformatics platform with a single TK1 (STK platform) was constructed, and this previous work is also used to prove that the Web and mobile services can be implemented in the STK platform with a good cost-performance ratio by comparing a STK platform with the desktop CPU and GPU. In this work, an embedded-based GPU cluster platform will be constructed with multiple TK1s (MTK platform). Complex system installation and setup are necessary procedures at first. Then, 2 job assignment modes are designed for the MTK platform to provide services for users. Finally, ClustalW v2.0.11 and ClustalWtk will be ported to the MTK platform. The experimental results showed that the speedup ratios achieved 5.5 and 4.8 times for ClustalW v2.0.11 and ClustalWtk, respectively, by comparing 6 TK1s with a single TK1. The MTK platform is proven to be useful for multiple sequence alignments.

摘要

高端图形处理单元(GPU),如英伟达Tesla/Fermi/Kepler系列芯片,每个芯片拥有数千个核心,在过去十年中被广泛应用于高性能计算领域。这些桌面GPU卡应安装在配备桌面CPU的个人电脑/服务器中,构建GPU集群平台的成本和功耗非常高。近年来,英伟达发布了一款嵌入式开发板,名为Jetson Tegra K1(TK1),它包含4个ARM Cortex-A15 CPU和192个计算统一设备架构核心(属于Kepler GPU)。Jetson Tegra K1具有成本低、功耗低和适用性强等优点,已被应用于多个特定应用中。在我们之前的工作中,构建了一个基于单个TK1的生物信息学平台(STK平台),并且通过将STK平台与桌面CPU和GPU进行比较,这项先前的工作也用于证明可以在STK平台上以良好的性价比实现网络和移动服务。在这项工作中,将构建一个基于多个TK1的嵌入式GPU集群平台(MTK平台)。首先,复杂的系统安装和设置是必要的步骤。然后,为MTK平台设计了2种作业分配模式,以便为用户提供服务。最后,将ClustalW v2.0.11和ClustalWtk移植到MTK平台。实验结果表明,通过将6个TK1与单个TK1进行比较,ClustalW v2.0.11和ClustalWtk的加速比分别达到了5.5倍和4.8倍。MTK平台被证明对于多序列比对是有用的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/f0257faa8eb9/10.1177_1176934317724764-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/7f29f306771b/10.1177_1176934317724764-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/6af3eb8a44be/10.1177_1176934317724764-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/c2b3367cb5a7/10.1177_1176934317724764-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/934eb4b0c9d8/10.1177_1176934317724764-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/c1daccd36754/10.1177_1176934317724764-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/a278b849813e/10.1177_1176934317724764-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/f0257faa8eb9/10.1177_1176934317724764-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/7f29f306771b/10.1177_1176934317724764-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/6af3eb8a44be/10.1177_1176934317724764-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/c2b3367cb5a7/10.1177_1176934317724764-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/934eb4b0c9d8/10.1177_1176934317724764-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/c1daccd36754/10.1177_1176934317724764-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/a278b849813e/10.1177_1176934317724764-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1402/5555494/f0257faa8eb9/10.1177_1176934317724764-fig7.jpg

相似文献

1
Embedded-Based Graphics Processing Unit Cluster Platform for Multiple Sequence Alignments.用于多序列比对的基于嵌入式的图形处理单元集群平台
Evol Bioinform Online. 2017 Aug 8;13:1176934317724764. doi: 10.1177/1176934317724764. eCollection 2017.
2
A real-time spike sorting method based on the embedded GPU.一种基于嵌入式图形处理器的实时尖峰排序方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:1010-1013. doi: 10.1109/EMBC.2017.8036997.
3
ReStAC-UAV-Borne Real-Time SGM Stereo Optimized for Embedded ARM and CUDA Devices.ReStAC - 针对嵌入式ARM和CUDA设备优化的无人机搭载实时SGM立体视觉
Sensors (Basel). 2021 Jun 7;21(11):3938. doi: 10.3390/s21113938.
4
NMF-mGPU: non-negative matrix factorization on multi-GPU systems.NMF-mGPU:多GPU系统上的非负矩阵分解
BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4.
5
Efficient methods for implementation of multi-level nonrigid mass-preserving image registration on GPUs and multi-threaded CPUs.在图形处理器(GPU)和多线程中央处理器(CPU)上实现多级非刚性质量守恒图像配准的高效方法。
Comput Methods Programs Biomed. 2016 Apr;127:290-300. doi: 10.1016/j.cmpb.2015.12.018. Epub 2016 Jan 6.
6
Compute-unified device architecture implementation of a block-matching algorithm for multiple graphical processing unit cards.用于多图形处理单元卡的块匹配算法的计算统一设备架构实现
J Electron Imaging. 2011 Jul;20(3). doi: 10.1117/1.3606588.
7
Accelerating Pythonic Coupled-Cluster Implementations: A Comparison Between CPUs and GPUs.加速Python风格的耦合簇实现:CPU与GPU的比较
J Chem Theory Comput. 2024 Feb 13;20(3):1130-1142. doi: 10.1021/acs.jctc.3c01110. Epub 2024 Feb 2.
8
CUDA ClustalW: An efficient parallel algorithm for progressive multiple sequence alignment on Multi-GPUs.CUDA ClustalW:一种用于在多图形处理器上进行渐进式多序列比对的高效并行算法。
Comput Biol Chem. 2015 Oct;58:62-8. doi: 10.1016/j.compbiolchem.2015.05.004. Epub 2015 May 21.
9
Exploiting graphics processing units for computational biology and bioinformatics.利用图形处理单元进行计算生物学和生物信息学。
Interdiscip Sci. 2010 Sep;2(3):213-20. doi: 10.1007/s12539-010-0002-4. Epub 2010 Jul 25.
10
A comparative analysis of near-infrared image colorization methods for low-power NVIDIA Jetson embedded systems.针对低功耗英伟达Jetson嵌入式系统的近红外图像上色方法的比较分析。
Front Neurorobot. 2023 Apr 24;17:1143032. doi: 10.3389/fnbot.2023.1143032. eCollection 2023.

引用本文的文献

1
First Report of Green Mold Disease Caused by on .由……引起的绿霉病的首次报告 于……上。 你提供的原文信息不完整,请补充完整以便我能准确翻译。
Mycobiology. 2020 Jul 24;48(5):427-430. doi: 10.1080/12298093.2020.1794230.
2
PhotoId-Whale: Blue whale dorsal fin classification for mobile devices.照片 ID-鲸鱼:移动设备上的蓝鲸鱼背鳍分类。
PLoS One. 2020 Oct 12;15(10):e0237570. doi: 10.1371/journal.pone.0237570. eCollection 2020.
3
Phylogenetic Analyses of Some Species (Agaricales, Tricholomataceae) in Northern China, With Descriptions of Two New Species and the Identification of Seven Species as a First Record.

本文引用的文献

1
CUDA ClustalW: An efficient parallel algorithm for progressive multiple sequence alignment on Multi-GPUs.CUDA ClustalW:一种用于在多图形处理器上进行渐进式多序列比对的高效并行算法。
Comput Biol Chem. 2015 Oct;58:62-8. doi: 10.1016/j.compbiolchem.2015.05.004. Epub 2015 May 21.
2
CUDA-BLASTP: accelerating BLASTP on CUDA-enabled graphics hardware.CUDA-BLASTP:在支持 CUDA 的图形硬件上加速 BLASTP。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1678-84. doi: 10.1109/TCBB.2011.33.
3
CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units.
中国北方部分伞菌目白蘑科物种的系统发育分析,兼两个新物种的描述及七个物种的首次记录鉴定
Front Microbiol. 2019 Sep 26;10:2167. doi: 10.3389/fmicb.2019.02167. eCollection 2019.
CUDASW++:针对支持CUDA的图形处理单元优化史密斯-沃特曼序列数据库搜索
BMC Res Notes. 2009 May 6;2:73. doi: 10.1186/1756-0500-2-73.
4
High-throughput sequence alignment using Graphics Processing Units.使用图形处理单元进行高通量序列比对。
BMC Bioinformatics. 2007 Dec 10;8:474. doi: 10.1186/1471-2105-8-474.
5
ClustalW-MPI: ClustalW analysis using distributed and parallel computing.ClustalW-MPI:使用分布式和并行计算的ClustalW分析。
Bioinformatics. 2003 Aug 12;19(12):1585-6. doi: 10.1093/bioinformatics/btg192.
6
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.CLUSTAL W:通过序列加权、位置特异性空位罚分和权重矩阵选择提高渐进多序列比对的灵敏度。
Nucleic Acids Res. 1994 Nov 11;22(22):4673-80. doi: 10.1093/nar/22.22.4673.
7
A general method applicable to the search for similarities in the amino acid sequence of two proteins.一种适用于寻找两种蛋白质氨基酸序列相似性的通用方法。
J Mol Biol. 1970 Mar;48(3):443-53. doi: 10.1016/0022-2836(70)90057-4.