• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对加速下一代测序分析流程进行基准测试。

Benchmarking accelerated next-generation sequencing analysis pipelines.

作者信息

Samarakoon Pubudu Saneth, Fournous Ghislain, Hansen Lars T, Wijesiri Ashen, Zhao Sen, Alex A Rodriguez, Nandi Tarak Nath, Madduri Ravi, Rowe Alexander D, Thomassen Gard, Hovig Eivind, Razick Sabry

机构信息

Scientific Computing Services, Division for Research, Dissemination and Education, University of Oslo, Oslo, 0373, Norway.

Norwegian National Unit for Newborn Screening, Division for Pediatric and Adolescent Medicine, Oslo University Hospital, Oslo, 0450, Norway.

出版信息

Bioinform Adv. 2025 May 15;5(1):vbaf085. doi: 10.1093/bioadv/vbaf085. eCollection 2025.

DOI:10.1093/bioadv/vbaf085
PMID:40395501
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12092081/
Abstract

MOTIVATION

Industry-standard central processing unit (CPU)-based next-generation sequencing (NGS) analysis tools have led to longer runtimes, affecting their utility in time-sensitive clinical practices and population-scale research studies. To address this, researchers have developed accelerated NGS platforms like DRAGEN and Parabricks, which have significantly reduced runtimes-from days to hours. However, these studies have evaluated accelerated platforms independently without sufficiently assessing computational resource usage or thoroughly investigating speedup scalability, a gap our study is designed to address.

RESULTS

Corroborating previous studies, accelerated pipelines demonstrated shorter runtimes than CPU-only approaches, with Parabricks-H100 demonstrating the highest speedups, followed by DRAGEN. In mapping, DRAGEN outperformed Parabricks (L4 and A100) and matched H100 speedups. Parabricks (A100 and H100) variant calling demonstrated higher speedups than DRAGEN. Moreover, DRAGEN and Parabricks-H100 mapping showed positive trends in the coverage-based scalability analysis, while other configurations failed to scale effectively. Our profiler analysis provided new insights into the relationships between Parabricks' performances and resource usage patterns, revealing its potential for further improvements. Our findings and cost comparison help researchers select accelerated platforms based on coverage needs, timeframes, and budget, while suggesting optimization strategies.

AVAILABILITY AND IMPLEMENTATION

Datasets are described in the 'Data availability' section. Our NGS pipelines are available at https://github.com/NAICNO/accelerated_genomics.

摘要

动机

基于行业标准中央处理器(CPU)的下一代测序(NGS)分析工具导致运行时间延长,影响了它们在对时间敏感的临床实践和大规模人群研究中的效用。为了解决这个问题,研究人员开发了如DRAGEN和Parabricks等加速NGS平台,这些平台显著缩短了运行时间——从数天缩短至数小时。然而,这些研究都是独立评估加速平台,没有充分评估计算资源的使用情况,也没有深入研究加速的可扩展性,我们的研究旨在填补这一空白。

结果

与之前的研究一致,加速流程的运行时间比仅使用CPU的方法更短,其中Parabricks-H100的加速效果最佳,其次是DRAGEN。在比对方面,DRAGEN的表现优于Parabricks(L4和A100),且加速效果与H100相当。Parabricks(A100和H100)的变异检测加速效果高于DRAGEN。此外,DRAGEN和Parabricks-H100的比对在基于覆盖度的可扩展性分析中呈现出积极趋势,而其他配置未能有效扩展。我们的分析器分析为Parabricks的性能与资源使用模式之间的关系提供了新的见解,揭示了其进一步改进的潜力。我们的研究结果和成本比较有助于研究人员根据覆盖需求、时间框架和预算选择加速平台,同时提出优化策略。

可用性和实现方式

数据集在“数据可用性”部分进行了描述。我们的NGS流程可在https://github.com/NAICNO/accelerated_genomics获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/81ac02e838e7/vbaf085f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/33d423e31e3a/vbaf085f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/2e1181787e6b/vbaf085f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/b386db03b0c0/vbaf085f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/0dfa81fcb451/vbaf085f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/81ac02e838e7/vbaf085f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/33d423e31e3a/vbaf085f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/2e1181787e6b/vbaf085f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/b386db03b0c0/vbaf085f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/0dfa81fcb451/vbaf085f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90e2/12092081/81ac02e838e7/vbaf085f5.jpg

相似文献

1
Benchmarking accelerated next-generation sequencing analysis pipelines.对加速下一代测序分析流程进行基准测试。
Bioinform Adv. 2025 May 15;5(1):vbaf085. doi: 10.1093/bioadv/vbaf085. eCollection 2025.
2
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.
3
Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略
Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.
4
Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。
Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.
5
Stakeholders' perceptions and experiences of factors influencing the commissioning, delivery, and uptake of general health checks: a qualitative evidence synthesis.利益相关者对影响一般健康检查的委托、提供和接受因素的看法与体验:一项定性证据综合分析
Cochrane Database Syst Rev. 2025 Mar 20;3(3):CD014796. doi: 10.1002/14651858.CD014796.pub2.
6
Molecular feature-based classification of retroperitoneal liposarcoma: a prospective cohort study.基于分子特征的腹膜后脂肪肉瘤分类:一项前瞻性队列研究。
Elife. 2025 May 23;14:RP100887. doi: 10.7554/eLife.100887.
7
Surveillance for Violent Deaths - National Violent Death Reporting System, 50 States, the District of Columbia, and Puerto Rico, 2022.暴力死亡监测——2022年全国暴力死亡报告系统,50个州、哥伦比亚特区和波多黎各
MMWR Surveill Summ. 2025 Jun 12;74(5):1-42. doi: 10.15585/mmwr.ss7405a1.
8
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
9
Community views on mass drug administration for soil-transmitted helminths: a qualitative evidence synthesis.社区对土壤传播蠕虫群体药物给药的看法:定性证据综合分析
Cochrane Database Syst Rev. 2025 Jun 20;6:CD015794. doi: 10.1002/14651858.CD015794.pub2.
10
Wood Waste Valorization and Classification Approaches: A systematic review.木材废料的增值与分类方法:一项系统综述
Open Res Eur. 2025 May 6;5:5. doi: 10.12688/openreseurope.18862.1. eCollection 2025.

本文引用的文献

1
Genomes in clinical care.临床医疗中的基因组
NPJ Genom Med. 2024 Mar 14;9(1):20. doi: 10.1038/s41525-024-00402-2.
2
Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools.使用 BWA-MEM2 和 Dragen-GATK 工具评估优化后的种系外显子组管道。
PLoS One. 2023 Aug 3;18(8):e0288371. doi: 10.1371/journal.pone.0288371. eCollection 2023.
3
Accelerating genomic workflows using NVIDIA Parabricks.利用 NVIDIA Parabricks 加速基因组工作流程。
BMC Bioinformatics. 2023 May 31;24(1):221. doi: 10.1186/s12859-023-05292-2.
4
Comparison of calling pipelines for whole genome sequencing: an empirical study demonstrating the importance of mapping and alignment.比较全基因组测序的调用管道:一项实证研究表明映射和比对的重要性。
Sci Rep. 2022 Dec 13;12(1):21502. doi: 10.1038/s41598-022-26181-3.
5
A GPU-accelerated compute framework for pathogen genomic variant identification to aid genomic epidemiology of infectious disease: a malaria case study.一个用于病原体基因组变异识别的 GPU 加速计算框架,以辅助传染病的基因组流行病学:以疟疾为例。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac314.
6
Accelerated identification of disease-causing variants with ultra-rapid nanopore genome sequencing.利用超快纳米孔基因组测序加速致病变异体的鉴定。
Nat Biotechnol. 2022 Jul;40(7):1035-1041. doi: 10.1038/s41587-022-01221-5. Epub 2022 Mar 28.
7
Effective variant filtering and expected candidate variant yield in studies of rare human disease.罕见人类疾病研究中的有效变异筛选及预期候选变异产出
NPJ Genom Med. 2021 Jul 15;6(1):60. doi: 10.1038/s41525-021-00227-3.
8
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
9
Accuracy and efficiency of germline variant calling pipelines for human genome data.人类基因组数据种系变异调用管道的准确性和效率。
Sci Rep. 2020 Nov 19;10(1):20222. doi: 10.1038/s41598-020-77218-4.
10
Best practices for variant calling in clinical sequencing.临床测序中变异调用的最佳实践。
Genome Med. 2020 Oct 26;12(1):91. doi: 10.1186/s13073-020-00791-w.