• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组序列处理的并行计算。

Parallel computing for genome sequence processing.

机构信息

Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering at Central South University, Changsha, China.

computer science at Old Dominion University, USA.

出版信息

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab070.

DOI:10.1093/bib/bbab070
PMID:33822883
Abstract

The rapid increase of genome data brought by gene sequencing technologies poses a massive challenge to data processing. To solve the problems caused by enormous data and complex computing requirements, researchers have proposed many methods and tools which can be divided into three types: big data storage, efficient algorithm design and parallel computing. The purpose of this review is to investigate popular parallel programming technologies for genome sequence processing. Three common parallel computing models are introduced according to their hardware architectures, and each of which is classified into two or three types and is further analyzed with their features. Then, the parallel computing for genome sequence processing is discussed with four common applications: genome sequence alignment, single nucleotide polymorphism calling, genome sequence preprocessing, and pattern detection and searching. For each kind of application, its background is firstly introduced, and then a list of tools or algorithms are summarized in the aspects of principle, hardware platform and computing efficiency. The programming model of each hardware and application provides a reference for researchers to choose high-performance computing tools. Finally, we discuss the limitations and future trends of parallel computing technologies.

摘要

测序技术产生的基因组数据的快速增长给数据处理带来了巨大的挑战。为了解决由大量数据和复杂计算要求引起的问题,研究人员提出了许多方法和工具,这些方法和工具可以分为三种类型:大数据存储、高效算法设计和并行计算。本综述旨在研究基因组序列处理中流行的并行编程技术。根据硬件架构介绍了三种常见的并行计算模型,每个模型分为两种或三种类型,并进一步分析了其特点。然后,通过四个常见的应用程序:基因组序列比对、单核苷酸多态性调用、基因组序列预处理和模式检测与搜索,讨论了基因组序列处理的并行计算。对于每种应用,首先介绍其背景,然后从原理、硬件平台和计算效率等方面总结了工具或算法列表。每种硬件和应用的编程模型为研究人员选择高性能计算工具提供了参考。最后,我们讨论了并行计算技术的局限性和未来趋势。

相似文献

1
Parallel computing for genome sequence processing.基因组序列处理的并行计算。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab070.
2
A hybrid short read mapping accelerator.一种混合短读映射加速器。
BMC Bioinformatics. 2013 Feb 26;14:67. doi: 10.1186/1471-2105-14-67.
3
Heterogeneous Cloud Framework for Big Data Genome Sequencing.用于大数据基因组测序的异构云框架
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):166-78. doi: 10.1109/TCBB.2014.2351800.
4
Practical guide for managing large-scale human genome data in research.研究中管理大规模人类基因组数据的实用指南。
J Hum Genet. 2021 Jan;66(1):39-52. doi: 10.1038/s10038-020-00862-1. Epub 2020 Oct 23.
5
Faster single-end alignment generation utilizing multi-thread for BWA.利用多线程实现更快的BWA单端比对生成。
Biomed Mater Eng. 2015;26 Suppl 1:S1791-6. doi: 10.3233/BME-151480.
6
Group-based variant calling leveraging next-generation supercomputing for large-scale whole-genome sequencing studies.基于群体的变异检测,利用下一代超级计算技术进行大规模全基因组测序研究。
BMC Bioinformatics. 2015 Sep 22;16(1):304. doi: 10.1186/s12859-015-0736-4.
7
Accurate estimation of short read mapping quality for next-generation genome sequencing.准确估计下一代基因组测序中短读测序数据的映射质量。
Bioinformatics. 2012 Sep 15;28(18):i349-i355. doi: 10.1093/bioinformatics/bts408.
8
MinimapR: A parallel alignment tool for the analysis of large-scale third-generation sequencing data.MinimapR:一种用于分析大规模第三代测序数据的并行比对工具。
Comput Biol Chem. 2022 Aug;99:107735. doi: 10.1016/j.compbiolchem.2022.107735. Epub 2022 Jul 13.
9
Rapid protein alignment in the cloud: HAMOND combines fast DIAMOND alignments with Hadoop parallelism.快速的云端蛋白质序列比对:HAMOND 将快速的 DIAMOND 比对与 Hadoop 并行处理相结合。
J Biotechnol. 2017 Sep 10;257:58-60. doi: 10.1016/j.jbiotec.2017.02.020. Epub 2017 Feb 21.
10
A fast adaptive algorithm for computing whole-genome homology maps.一种用于计算全基因组同源图谱的快速自适应算法。
Bioinformatics. 2018 Sep 1;34(17):i748-i756. doi: 10.1093/bioinformatics/bty597.

引用本文的文献

1
Sequencing technologies and hardware-accelerated parallel computing transform computational genomics research.测序技术和硬件加速的并行计算改变了计算基因组学研究。
Front Bioinform. 2024 Mar 19;4:1384497. doi: 10.3389/fbinf.2024.1384497. eCollection 2024.
2
SynBioTools: a one-stop facility for searching and selecting synthetic biology tools.SynBioTools:一站式搜索和选择合成生物学工具的平台。
BMC Bioinformatics. 2023 Apr 17;24(1):152. doi: 10.1186/s12859-023-05281-5.
3
A Review of Parallel Implementations for the Smith-Waterman Algorithm.
《Smith-Waterman 算法的并行实现综述》。
Interdiscip Sci. 2022 Mar;14(1):1-14. doi: 10.1007/s12539-021-00473-0. Epub 2021 Sep 6.