• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床Panel测序中读段比对工具的性能评估方法

Performance evaluation method for read mapping tool in clinical panel sequencing.

作者信息

Lee Hojun, Lee Ki-Wook, Lee Taeseob, Park Donghyun, Chung Jongsuk, Lee Chung, Park Woong-Yang, Son Dae-Soon

机构信息

1Samsung Genome Institute (SGI), Samsung Medical Center (SMC), Seoul, 06351 South Korea.

2Department of Digital Health, SAIHST, Sungkyunkwan University, Seoul, 06351 South Korea.

出版信息

Genes Genomics. 2018;40(2):189-197. doi: 10.1007/s13258-017-0621-9. Epub 2017 Nov 9.

DOI:10.1007/s13258-017-0621-9
PMID:29568413
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5846869/
Abstract

In addition to the rapid advancement in Next-Generation Sequencing (NGS) technology, clinical panel sequencing is being used increasingly in clinical studies and tests. However, tools that are used in NGS data analysis have not been comparatively evaluated in performance for panel sequencing. This study aimed to evaluate the tools used in the alignment process, the first procedure in bioinformatics analysis, by comparing tools that have been widely used with ones that have been introduced recently. With the accumulated panel sequencing data, detected variant lists were cataloged and inserted into simulated reads produced from the reference genome (h19). The amount of unmapped reads and misaligned reads, mapping quality distribution, and runtime were measured as standards for comparison. As the most widely used tools, Bowtie2 and BWA-MEM each showed explicit performance with AUC of 0.9984 and 0.9970 respectively. Kart, maintaining superior runtime and less number of misaligned read, also similarly possessed high level of AUC (0.9723). Such selection and optimization method of tools appropriate for panel sequencing can be utilized for fields requiring error minimization, such as clinical application and liquid biopsy studies.

摘要

除了新一代测序(NGS)技术的快速发展外,临床panel测序在临床研究和检测中的应用也越来越广泛。然而,用于NGS数据分析的工具在panel测序性能方面尚未得到比较评估。本研究旨在通过比较广泛使用的工具和最近引入的工具,评估生物信息学分析的第一步——比对过程中使用的工具。利用积累的panel测序数据,将检测到的变异列表编目并插入从参考基因组(h19)产生的模拟 reads 中。将未比对 reads 和比对错误 reads 的数量、比对质量分布和运行时间作为比较标准进行测量。作为使用最广泛的工具,Bowtie2和BWA-MEM分别以0.9984和0.9970的AUC显示出明显的性能。Kart在保持出色运行时间和较少比对错误 reads 数量的同时,也同样具有较高水平的AUC(0.9723)。这种适用于panel测序的工具选择和优化方法可用于需要将错误最小化的领域,如临床应用和液体活检研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/8f48e56fa038/13258_2017_621_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/5f5f4baf66b3/13258_2017_621_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/e38007774158/13258_2017_621_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/a1d5d6defad8/13258_2017_621_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/8f48e56fa038/13258_2017_621_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/5f5f4baf66b3/13258_2017_621_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/e38007774158/13258_2017_621_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/a1d5d6defad8/13258_2017_621_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45dd/5846869/8f48e56fa038/13258_2017_621_Fig4_HTML.jpg

相似文献

1
Performance evaluation method for read mapping tool in clinical panel sequencing.临床Panel测序中读段比对工具的性能评估方法
Genes Genomics. 2018;40(2):189-197. doi: 10.1007/s13258-017-0621-9. Epub 2017 Nov 9.
2
A fast read alignment method based on seed-and-vote for next generation sequencing.一种基于种子与投票的用于下一代测序的快速读段比对方法。
BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):466. doi: 10.1186/s12859-016-1329-6.
3
Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics.基于全基因组特征,对多种新一代测序比对器的读段比对进行评估。
Genomics. 2017 Jul;109(3-4):186-191. doi: 10.1016/j.ygeno.2017.03.001. Epub 2017 Mar 9.
4
Re-alignment of the unmapped reads with base quality score.将未映射的 reads 与碱基质量得分重新比对。
BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S8. doi: 10.1186/1471-2105-16-S5-S8. Epub 2015 Mar 18.
5
SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees.SRPRISM(单读配对读插入缺失替换最小化器):具有明确保证的组装的高效对齐器。
Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa023.
6
Faster single-end alignment generation utilizing multi-thread for BWA.利用多线程实现更快的BWA单端比对生成。
Biomed Mater Eng. 2015;26 Suppl 1:S1791-6. doi: 10.3233/BME-151480.
7
Multi-threading the generation of Burrows-Wheeler Alignment.多线程生成布罗-惠勒比对。
Genet Mol Res. 2016 May 23;15(2):gmr8650. doi: 10.4238/gmr.15028650.
8
Comparative analysis of algorithms for next-generation sequencing read alignment.下一代测序读段比对算法的比较分析。
Bioinformatics. 2011 Oct 15;27(20):2790-6. doi: 10.1093/bioinformatics/btr477. Epub 2011 Aug 19.
9
Kart: a divide-and-conquer algorithm for NGS read alignment.Kart:一种用于二代测序读段比对的分治算法。
Bioinformatics. 2017 Aug 1;33(15):2281-2287. doi: 10.1093/bioinformatics/btx189.
10
Accelerating the Next Generation Long Read Mapping with the FPGA-Based System.利用基于现场可编程门阵列(FPGA)的系统加速下一代长读长映射
IEEE/ACM Trans Comput Biol Bioinform. 2014 Sep-Oct;11(5):840-52. doi: 10.1109/TCBB.2014.2326876.

引用本文的文献

1
Prediction of antimicrobial resistance in with a machine learning classifier based on WGS data.基于全基因组测序(WGS)数据,利用机器学习分类器预测[具体对象]中的抗菌药物耐药性。 (注:原文中“in with”表述有误,推测可能是“in [具体对象] with”,这里根据可能情况补充完整翻译)
Microbiol Spectr. 2025 Sep 2;13(9):e0006525. doi: 10.1128/spectrum.00065-25. Epub 2025 Aug 5.
2
Comparative analysis of batch correction methods for FDG PET/CT using metabolic radiogenomic data of lung cancer patients.基于肺癌患者代谢放射基因组学数据的 FDG PET/CT 批处理校正方法的比较分析。
Sci Rep. 2023 Oct 25;13(1):18247. doi: 10.1038/s41598-023-45296-9.
3
Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools.

本文引用的文献

1
Characterization of background noise in capture-based targeted sequencing data.基于捕获的靶向测序数据中背景噪声的特征分析
Genome Biol. 2017 Jul 21;18(1):136. doi: 10.1186/s13059-017-1275-2.
2
Clinical Application of Targeted Deep Sequencing in Solid-Cancer Patients and Utility for Biomarker-Selected Clinical Trials.实体瘤患者靶向深度测序的临床应用及用于生物标志物选择临床试验的效用。
Oncologist. 2017 Oct;22(10):1169-1177. doi: 10.1634/theoncologist.2017-0020. Epub 2017 Jul 12.
3
Kart: a divide-and-conquer algorithm for NGS read alignment.
使用 BWA-MEM2 和 Dragen-GATK 工具评估优化后的种系外显子组管道。
PLoS One. 2023 Aug 3;18(8):e0288371. doi: 10.1371/journal.pone.0288371. eCollection 2023.
4
A Protocol for Genome-Wide Analysis of DNA Replication Timing in Intact Root Tips.用于完整根尖中全基因组 DNA 复制时间分析的方案。
Methods Mol Biol. 2022;2382:29-72. doi: 10.1007/978-1-0716-1744-1_3.
5
Genome-wide sequencing as a first-tier screening test for short tandem repeat expansions.全基因组测序作为短串联重复扩展的一线筛查试验。
Genome Med. 2021 Aug 9;13(1):126. doi: 10.1186/s13073-021-00932-9.
6
Genome assembly using quantum and quantum-inspired annealing.基于量子和量子启发式退火的基因组组装。
Sci Rep. 2021 Jun 23;11(1):13183. doi: 10.1038/s41598-021-88321-5.
7
Selection of Optimal Bioinformatic Tools and Proper Reference for Reducing the Alignment Error in Targeted Sequencing Data.选择最佳生物信息学工具和合适的参考以减少靶向测序数据中的比对错误。
J Med Signals Sens. 2021 Jan 30;11(1):37-44. doi: 10.4103/jmss.JMSS_7_20. eCollection 2021 Jan-Mar.
8
Performance evaluation of pipelines for mapping, variant calling and interval padding, for the analysis of NGS germline panels.用于分析NGS种系基因检测板的映射、变异位点检测和区间填充流程的性能评估。
BMC Bioinformatics. 2021 Apr 28;22(1):218. doi: 10.1186/s12859-021-04144-1.
9
Metabolic radiogenomics in lung cancer: associations between FDG PET image features and oncogenic signaling pathway alterations.肺癌的代谢放射组学:~18F-FDG PET 图像特征与致癌信号通路改变之间的关联。
Sci Rep. 2020 Aug 6;10(1):13231. doi: 10.1038/s41598-020-70168-x.
10
Parallel comparison and combining effect of radiomic and emerging genomic data for prognostic stratification of non-small cell lung carcinoma patients.基于影像组学和新兴基因组数据的平行比较及其联合作用对非小细胞肺癌患者进行预后分层。
Thorac Cancer. 2020 Sep;11(9):2542-2551. doi: 10.1111/1759-7714.13568. Epub 2020 Jul 22.
Kart:一种用于二代测序读段比对的分治算法。
Bioinformatics. 2017 Aug 1;33(15):2281-2287. doi: 10.1093/bioinformatics/btx189.
4
Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics.基于全基因组特征,对多种新一代测序比对器的读段比对进行评估。
Genomics. 2017 Jul;109(3-4):186-191. doi: 10.1016/j.ygeno.2017.03.001. Epub 2017 Mar 9.
5
AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.AlignerBoost:一种基于贝叶斯映射质量框架提高下一代测序映射准确性的通用软件工具包。
PLoS Comput Biol. 2016 Oct 5;12(10):e1005096. doi: 10.1371/journal.pcbi.1005096. eCollection 2016 Oct.
6
The minimal amount of starting DNA for Agilent's hybrid capture-based targeted massively parallel sequencing.安捷伦基于杂交捕获的靶向大规模平行测序所需的起始DNA最小量。
Sci Rep. 2016 May 25;6:26732. doi: 10.1038/srep26732.
7
Compressive mapping for next-generation sequencing.用于下一代测序的压缩映射
Nat Biotechnol. 2016 Apr;34(4):374-6. doi: 10.1038/nbt.3511.
8
Integrated digital error suppression for improved detection of circulating tumor DNA.用于改善循环肿瘤DNA检测的集成数字误差抑制
Nat Biotechnol. 2016 May;34(5):547-555. doi: 10.1038/nbt.3520. Epub 2016 Mar 28.
9
Targeted single molecule mutation detection with massively parallel sequencing.利用大规模平行测序进行靶向单分子突变检测。
Nucleic Acids Res. 2016 Feb 18;44(3):e22. doi: 10.1093/nar/gkv915. Epub 2015 Sep 17.
10
BatAlign: an incremental method for accurate alignment of sequencing reads.BatAlign:一种用于测序读段精确比对的增量方法。
Nucleic Acids Res. 2015 Sep 18;43(16):e107. doi: 10.1093/nar/gkv533. Epub 2015 Jul 13.