• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BAMClipper:从比对中移除引物,以最大限度减少扩增子下一代测序中的假阴性突变。

BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing.

机构信息

Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & Hospital, Happy Valley, Hong Kong SAR, China.

Department of Surgery, The University of Hong Kong, Happy Valley, Hong Kong SAR, China.

出版信息

Sci Rep. 2017 May 8;7(1):1567. doi: 10.1038/s41598-017-01703-6.

DOI:10.1038/s41598-017-01703-6
PMID:28484262
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5431517/
Abstract

Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper.

摘要

基于扩增子的新一代测序(NGS)已被广泛应用于人类和其他生物的遗传变异检测。传统数据分析方法包括在读取映射前进行引物修剪。本文介绍了 BAMClipper,它通过软剪辑 SAM/BAM 比对,在映射原始测序读取后去除引物序列。基于已知 BRCA1/BRCA2 突变的 7 个人外周血或乳腺癌组织样本和 >130000 个具有独特突变的模拟 NGS 数据集的真实 NGS 数据集,研究了引物处理方法的选择对突变检测准确性的影响。BAMClipper 方法检测到了 BRCA1 缺失(c.1620_1636del),否则由于边缘效应而被遗漏。模拟显示,当引物如传统方法中那样完美修剪时,假阴性率很高。在其他 6 个样本中,5 个 BRCA1/BRCA2 突变(插入或单核苷酸变异)的变异等位基因频率被来自重叠扩增子的明显野生型引物序列稀释(低估 17%至 82%)。BAMClipper 在这两种情况下都很稳健,所有 7 个突变都被检测到。与 Cutadapt 相比,BAMClipper 速度更快,并且保持了相同的高引物去除效率。BAMClipper 是用 Perl 实现的,并在 https://github.com/tommyau/bamclipper 上以开源 MIT 许可证提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/da6a6241d81f/41598_2017_1703_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/295b0e6ca79f/41598_2017_1703_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/06d4673311d3/41598_2017_1703_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/ec139bdcd584/41598_2017_1703_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/da6a6241d81f/41598_2017_1703_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/295b0e6ca79f/41598_2017_1703_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/06d4673311d3/41598_2017_1703_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/ec139bdcd584/41598_2017_1703_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec9a/5431517/da6a6241d81f/41598_2017_1703_Fig4_HTML.jpg

相似文献

1
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing.BAMClipper:从比对中移除引物,以最大限度减少扩增子下一代测序中的假阴性突变。
Sci Rep. 2017 May 8;7(1):1567. doi: 10.1038/s41598-017-01703-6.
2
BRCA-analyzer: Automatic workflow for processing NGS reads of BRCA1 and BRCA2 genes.BRCA-analyzer:BRCA1 和 BRCA2 基因 NGS 读取的自动处理工作流程。
Comput Biol Chem. 2018 Dec;77:297-306. doi: 10.1016/j.compbiolchem.2018.10.012. Epub 2018 Oct 23.
3
cutPrimers: A New Tool for Accurate Cutting of Primers from Reads of Targeted Next Generation Sequencing.cutPrimers:一种从靶向新一代测序读段中准确切割引物的新工具。
J Comput Biol. 2017 Nov;24(11):1138-1143. doi: 10.1089/cmb.2017.0096. Epub 2017 Jul 17.
4
Evaluation of an amplicon-based next-generation sequencing panel for detection of BRCA1 and BRCA2 genetic variants.基于扩增子的新一代测序检测板用于检测BRCA1和BRCA2基因变异的评估。
Breast Cancer Res Treat. 2016 Aug;158(3):433-40. doi: 10.1007/s10549-016-3891-z. Epub 2016 Jul 6.
5
Edge effects in calling variants from targeted amplicon sequencing.靶向扩增子测序中变异检测的边缘效应
BMC Genomics. 2014 Dec 5;15(1):1073. doi: 10.1186/1471-2164-15-1073.
6
Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management.基于扩增子的神经系统疾病基因panel的二代测序数据分析:一种新的等位基因脱扣管理方法
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):339. doi: 10.1186/s12859-016-1189-0.
7
INDELseek: detection of complex insertions and deletions from next-generation sequencing data.INDELseek:从下一代测序数据中检测复杂插入和缺失
BMC Genomics. 2017 Jan 5;18(1):16. doi: 10.1186/s12864-016-3449-9.
8
AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.AlignerBoost:一种基于贝叶斯映射质量框架提高下一代测序映射准确性的通用软件工具包。
PLoS Comput Biol. 2016 Oct 5;12(10):e1005096. doi: 10.1371/journal.pcbi.1005096. eCollection 2016 Oct.
9
Ultrasensitive single-genome sequencing: accurate, targeted, next generation sequencing of HIV-1 RNA.超灵敏单基因组测序:HIV-1 RNA的准确、靶向新一代测序
Retrovirology. 2016 Dec 20;13(1):87. doi: 10.1186/s12977-016-0321-6.
10
Detection of Germline Mutation in Hereditary Breast and/or Ovarian Cancers by Next-Generation Sequencing on a Four-Gene Panel.利用四基因检测板通过二代测序检测遗传性乳腺癌和/或卵巢癌中的胚系突变
J Mol Diagn. 2016 Jul;18(4):580-94. doi: 10.1016/j.jmoldx.2016.03.005. Epub 2016 May 5.

引用本文的文献

1
Using minor variant genomes and machine learning to study the genome biology of SARS-CoV-2 over time.利用次要变异基因组和机器学习研究新冠病毒随时间变化的基因组生物学。
Nucleic Acids Res. 2025 Feb 8;53(4). doi: 10.1093/nar/gkaf077.
2
Novel microsatellite instability test of sebaceous tumours to facilitate low-cost universal screening for Lynch syndrome.皮脂腺肿瘤的新型微卫星不稳定性检测,以促进对林奇综合征的低成本普遍筛查。
Clin Exp Dermatol. 2025 May 23;50(6):1155-1162. doi: 10.1093/ced/llaf005.
3
Whole-genome sequencing surveillance of Siberian tick-borne encephalitis virus (TBEV) identifies an additional lineage in Kyrgyzstan.

本文引用的文献

1
Homoharringtonine (omacetaxine mepesuccinate) as an adjunct for FLT3-ITD acute myeloid leukemia.奥马环素(依维莫司)作为 FLT3-ITD 急性髓系白血病的辅助药物。
Sci Transl Med. 2016 Oct 5;8(359):359ra129. doi: 10.1126/scitranslmed.aaf3735.
2
Clinical Effects of Driver Somatic Mutations on the Outcomes of Patients With Myelodysplastic Syndromes Treated With Allogeneic Hematopoietic Stem-Cell Transplantation.驱动体细胞突变对接受异基因造血干细胞移植治疗的骨髓增生异常综合征患者预后的临床影响。
J Clin Oncol. 2016 Oct 20;34(30):3627-3637. doi: 10.1200/JCO.2016.67.3616.
3
Next-generation sequencing with a myeloid gene panel in core-binding factor AML showed KIT activation loop and TET2 mutations predictive of outcome.
对西伯利亚蜱传脑炎病毒(TBEV)进行的全基因组测序监测在吉尔吉斯斯坦发现了一个新的谱系。
Virus Res. 2025 Jan;351:199517. doi: 10.1016/j.virusres.2024.199517. Epub 2024 Dec 22.
4
Development of targeted whole genome sequencing approaches for Crimean-Congo haemorrhagic fever virus (CCHFV).针对克里米亚-刚果出血热病毒(CCHFV)的靶向全基因组测序方法的开发。
Virus Res. 2024 Dec;350:199464. doi: 10.1016/j.virusres.2024.199464. Epub 2024 Sep 20.
5
Rare variation in non-coding regions with evolutionary signatures contributes to autism spectrum disorder risk.具有进化特征的非编码区域中的罕见变异导致自闭症谱系障碍风险增加。
Cell Genom. 2024 Aug 14;4(8):100609. doi: 10.1016/j.xgen.2024.100609. Epub 2024 Jul 16.
6
SARS-CoV-2 population dynamics in immunocompetent individuals in a closed transmission chain shows genomic diversity over the course of infection.在封闭传播链中免疫功能正常个体的新冠病毒种群动态显示,在感染过程中存在基因组多样性。
Genome Med. 2024 Jul 16;16(1):89. doi: 10.1186/s13073-024-01360-1.
7
UnCoVar: a reproducible and scalable workflow for transparent and robust virus variant calling and lineage assignment using SARS-CoV-2 as an example.UnCoVar:一个可重现和可扩展的工作流程,用于使用 SARS-CoV-2 作为示例进行透明且稳健的病毒变异体检测和谱系分配。
BMC Genomics. 2024 Jun 28;25(1):647. doi: 10.1186/s12864-024-10539-0.
8
Lessons learned: overcoming common challenges in reconstructing the SARS-CoV-2 genome from short-read sequencing data via CoVpipe2.经验教训:通过CoVpipe2从短读长测序数据重建严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组时克服常见挑战。
F1000Res. 2024 Apr 16;12:1091. doi: 10.12688/f1000research.136683.1. eCollection 2023.
9
Cell lineage analysis with somatic mutations reveals late divergence of neuronal cell types and cortical areas in human cerebral cortex.利用体细胞突变进行的细胞谱系分析揭示了人类大脑皮质中神经元细胞类型和皮质区域的晚期分化。
bioRxiv. 2023 Nov 6:2023.11.06.565899. doi: 10.1101/2023.11.06.565899.
10
The P323L substitution in the SARS-CoV-2 polymerase (NSP12) confers a selective advantage during infection.SARS-CoV-2 聚合酶(NSP12)中的 P323L 取代在感染过程中赋予了选择性优势。
Genome Biol. 2023 Mar 13;24(1):47. doi: 10.1186/s13059-023-02881-5.
在核心结合因子急性髓系白血病中使用髓系基因panel进行的下一代测序显示,KIT激活环和TET2突变可预测预后。
Blood Cancer J. 2016 Jul 8;6(7):e442. doi: 10.1038/bcj.2016.51.
4
Sequence Variant Descriptions: HGVS Nomenclature and Mutalyzer.序列变异描述:HGVS命名法与Mutalyzer
Curr Protoc Hum Genet. 2016 Jul 1;90:7.13.1-7.13.19. doi: 10.1002/cphg.2.
5
Detection of Germline Mutation in Hereditary Breast and/or Ovarian Cancers by Next-Generation Sequencing on a Four-Gene Panel.利用四基因检测板通过二代测序检测遗传性乳腺癌和/或卵巢癌中的胚系突变
J Mol Diagn. 2016 Jul;18(4):580-94. doi: 10.1016/j.jmoldx.2016.03.005. Epub 2016 May 5.
6
Clinical evaluation of panel testing by next-generation sequencing (NGS) for gene mutations in myeloid neoplasms.采用新一代测序(NGS)进行髓系肿瘤基因突变的panel检测的临床评估。
Diagn Pathol. 2016 Jan 22;11:11. doi: 10.1186/s13000-016-0456-8.
7
Panel Testing for Familial Breast Cancer: Calibrating the Tension Between Research and Clinical Care.面板测试家族性乳腺癌:研究与临床护理之间的紧张关系的校准。
J Clin Oncol. 2016 May 1;34(13):1455-9. doi: 10.1200/JCO.2015.63.7454. Epub 2016 Jan 19.
8
Advances in clinical next-generation sequencing: target enrichment and sequencing technologies.临床新一代测序技术的进展:靶向富集和测序技术
Expert Rev Mol Diagn. 2016;16(3):357-72. doi: 10.1586/14737159.2016.1133298. Epub 2016 Jan 18.
9
Next-Generation Sequencing of Pulmonary Sarcomatoid Carcinoma Reveals High Frequency of Actionable MET Gene Mutations.肺肉瘤样癌的下一代测序揭示了 MET 基因高频激活突变。
J Clin Oncol. 2016 Mar 10;34(8):794-802. doi: 10.1200/JCO.2015.62.0674. Epub 2015 Jul 27.
10
Edge effects in calling variants from targeted amplicon sequencing.靶向扩增子测序中变异检测的边缘效应
BMC Genomics. 2014 Dec 5;15(1):1073. doi: 10.1186/1471-2164-15-1073.