• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

片段化酶诱导的双链假象的表征与缓解

Characterization and mitigation of fragmentation enzyme-induced dual stranded artifacts.

作者信息

Gregory Thomas, Ngankeu Apollinaire, Orwick Shelley, Kautto Esko A, Woyach Jennifer A, Byrd John C, Blachly James S

机构信息

Division of Hematology, Ohio State University, Columbus, OH 43210, USA.

出版信息

NAR Genom Bioinform. 2020 Dec;2(4):lqaa070. doi: 10.1093/nargab/lqaa070. Epub 2020 Oct 2.

DOI:10.1093/nargab/lqaa070
PMID:33043294
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7531576/
Abstract

High-throughput short-read sequencing relies on fragmented DNA for optimal sampling of input nucleic acid. Several vendors now offer proprietary enzyme cocktails as a cheaper and more streamlined method of fragmentation when compared to acoustic shearing. We have discovered that these enzymes induce the formation of library molecules containing regions of nearby DNA from opposite strands. Sequencing reads derived from these molecules can lead to artifact-derived variant calls appearing at variant allele frequencies <5%. We present Fragmentation Artifact Detection and Elimination (FADE), software to remove these artifacts from mapped reads and mitigate artifact-related effects on downstream analysis. We find that the artifacts principally affect downstream analyses that are sensitive to a 1-3% artifact bias in the sequencing reads, such as targeted resequencing and rare variant discovery.

摘要

高通量短读长测序依赖于片段化的DNA以实现对输入核酸的最佳采样。与声学剪切相比,现在有几家供应商提供专利酶混合物,作为一种更便宜、更简化的片段化方法。我们发现,这些酶会诱导形成包含来自相反链的附近DNA区域的文库分子。来自这些分子的测序读数可能导致在变异等位基因频率<5%时出现源自假象的变异调用。我们提出了片段化假象检测与消除(FADE)软件,用于从比对读数中去除这些假象,并减轻假象对下游分析的相关影响。我们发现,这些假象主要影响对测序读数中1-3%的假象偏差敏感的下游分析,如靶向重测序和罕见变异发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4705/7671401/7256dafcd82c/lqaa070fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4705/7671401/c34bbe75aee7/lqaa070fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4705/7671401/f1defbbbb2a5/lqaa070fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4705/7671401/7256dafcd82c/lqaa070fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4705/7671401/c34bbe75aee7/lqaa070fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4705/7671401/f1defbbbb2a5/lqaa070fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4705/7671401/7256dafcd82c/lqaa070fig3.jpg

相似文献

1
Characterization and mitigation of fragmentation enzyme-induced dual stranded artifacts.片段化酶诱导的双链假象的表征与缓解
NAR Genom Bioinform. 2020 Dec;2(4):lqaa070. doi: 10.1093/nargab/lqaa070. Epub 2020 Oct 2.
2
Characterization and mitigation of artifacts derived from NGS library preparation due to structure-specific sequences in the human genome.由于人类基因组中具有结构特异性的序列,对源自 NGS 文库制备的伪影进行特征描述和缓解。
BMC Genomics. 2024 Mar 1;25(1):227. doi: 10.1186/s12864-024-10157-w.
3
Sources of erroneous sequences and artifact chimeric reads in next generation sequencing of genomic DNA from formalin-fixed paraffin-embedded samples.从福尔马林固定石蜡包埋样本的基因组 DNA 进行下一代测序中错误序列和人工嵌合读取的来源。
Nucleic Acids Res. 2019 Jan 25;47(2):e12. doi: 10.1093/nar/gky1142.
4
Identifying, understanding, and correcting technical artifacts on the sex chromosomes in next-generation sequencing data.鉴定、理解和纠正下一代测序数据中性染色体上的技术伪影。
Gigascience. 2019 Jul 1;8(7). doi: 10.1093/gigascience/giz074.
5
A simple method for semi-random DNA amplicon fragmentation using the methylation-dependent restriction enzyme MspJI.一种使用甲基化依赖性限制酶MspJI进行半随机DNA扩增子片段化的简单方法。
BMC Biotechnol. 2015 Apr 11;15:25. doi: 10.1186/s12896-015-0139-7.
6
Characterization of background noise in capture-based targeted sequencing data.基于捕获的靶向测序数据中背景噪声的特征分析
Genome Biol. 2017 Jul 21;18(1):136. doi: 10.1186/s13059-017-1275-2.
7
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.测序伪影源自使用酶切片段化的文库制备方法。
PLoS One. 2020 Jan 3;15(1):e0227427. doi: 10.1371/journal.pone.0227427. eCollection 2020.
8
Next generation sequencing of STR artifacts produced from historical bone samples.下一代 STR 序列分析技术在历史骨骼样本 STR 痕迹中的应用。
Forensic Sci Int Genet. 2020 Nov;49:102397. doi: 10.1016/j.fsigen.2020.102397. Epub 2020 Sep 28.
9
UNDR ROVER - a fast and accurate variant caller for targeted DNA sequencing.UNDR ROVER——一种用于靶向DNA测序的快速且准确的变异检测工具。
BMC Bioinformatics. 2016 Apr 16;17:165. doi: 10.1186/s12859-016-1014-9.
10
RepLong: de novo repeat identification using long read sequencing data.RepLong:利用长读测序数据进行从头重复识别。
Bioinformatics. 2018 Apr 1;34(7):1099-1107. doi: 10.1093/bioinformatics/btx717.

引用本文的文献

1
Replication stress induces POLQ-mediated structural variant formation throughout common fragile sites after entry into mitosis.复制压力诱导 POLQ 介导的结构变异形成,在有丝分裂进入后贯穿常见脆弱部位。
Nat Commun. 2024 Nov 6;15(1):9582. doi: 10.1038/s41467-024-53917-8.
2
Characterization and mitigation of artifacts derived from NGS library preparation due to structure-specific sequences in the human genome.由于人类基因组中具有结构特异性的序列,对源自 NGS 文库制备的伪影进行特征描述和缓解。
BMC Genomics. 2024 Mar 1;25(1):227. doi: 10.1186/s12864-024-10157-w.
3
A T cell receptor targeting a recurrent driver mutation in FLT3 mediates elimination of primary human acute myeloid leukemia in vivo.

本文引用的文献

1
Use of FFPE-derived DNA in next generation sequencing: DNA extraction methods.FFPE 衍生 DNA 在下一代测序中的应用:DNA 提取方法。
PLoS One. 2019 Apr 11;14(4):e0211400. doi: 10.1371/journal.pone.0211400. eCollection 2019.
2
Fitness Costs and the Rapid Spread of -C580Y Substitutions Conferring Artemisinin Resistance.适应性成本与快速传播的 C580Y 取代突变赋予青蒿素耐药性。
Antimicrob Agents Chemother. 2018 Aug 27;62(9). doi: 10.1128/AAC.00605-18. Print 2018 Sep.
3
Molecular Minimal Residual Disease in Acute Myeloid Leukemia.
一种靶向 FLT3 复发驱动突变的 T 细胞受体在体内介导原发性人急性髓系白血病的消除。
Nat Cancer. 2023 Oct;4(10):1474-1490. doi: 10.1038/s43018-023-00642-8. Epub 2023 Oct 2.
4
Validation of genetic variants from NGS data using deep convolutional neural networks.使用深度卷积神经网络验证 NGS 数据中的遗传变异。
BMC Bioinformatics. 2023 Apr 20;24(1):158. doi: 10.1186/s12859-023-05255-7.
5
CRISPR-KRISPR: a method to identify on-target and random insertion of donor DNAs and their characterization in knock-in mice.CRISPR-KRISPR:一种鉴定供体 DNA 靶向和随机插入及其在基因敲入小鼠中特征的方法。
Genome Biol. 2022 Oct 25;23(1):228. doi: 10.1186/s13059-022-02779-8.
6
OPUSeq simplifies detection of low-frequency DNA variants and uncovers fragmentase-associated artifacts.OPUSeq简化了低频DNA变异的检测,并揭示了与片段酶相关的假象。
NAR Genom Bioinform. 2022 Jun 27;4(2):lqac048. doi: 10.1093/nargab/lqac048. eCollection 2022 Jun.
急性髓系白血病的分子微小残留病。
N Engl J Med. 2018 Mar 29;378(13):1189-1199. doi: 10.1056/NEJMoa1716863.
4
Midostaurin plus Chemotherapy for Acute Myeloid Leukemia with a FLT3 Mutation.米哚妥林联合化疗治疗伴有FLT3突变的急性髓系白血病
N Engl J Med. 2017 Aug 3;377(5):454-464. doi: 10.1056/NEJMoa1614359. Epub 2017 Jun 23.
5
A performance evaluation of Nextera XT and KAPA HyperPlus for rapid Illumina library preparation of long-range mitogenome amplicons.Nextera XT和KAPA HyperPlus用于快速制备长距离有丝分裂基因组扩增子的Illumina文库的性能评估。
Forensic Sci Int Genet. 2017 Jul;29:174-180. doi: 10.1016/j.fsigen.2017.04.003. Epub 2017 Apr 5.
6
BTK-Mediated Resistance to Ibrutinib in Chronic Lymphocytic Leukemia.布鲁顿酪氨酸激酶介导的慢性淋巴细胞白血病对依鲁替尼的耐药性
J Clin Oncol. 2017 May 1;35(13):1437-1443. doi: 10.1200/JCO.2016.70.2282. Epub 2017 Feb 13.
7
High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients.使用条形码序列鉴定单个分子的高保真靶向测序:癌症患者血浆游离DNA中突变的从头检测和绝对定量。
DNA Res. 2015 Aug;22(4):269-77. doi: 10.1093/dnares/dsv010. Epub 2015 Jun 29.
8
Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads.Skewer:一种用于新一代测序双端读段的快速且准确的接头修剪工具。
BMC Bioinformatics. 2014 Jun 12;15:182. doi: 10.1186/1471-2105-15-182.
9
Library construction for next-generation sequencing: overviews and challenges.下一代测序文库构建:概述与挑战。
Biotechniques. 2014 Feb 1;56(2):61-4, 66, 68, passim. doi: 10.2144/000114133. eCollection 2014.
10
Telomere extension by telomerase and ALT generates variant repeats by mechanistically distinct processes.端粒酶和 ALT 通过机制上不同的过程延长端粒并产生变异重复序列。
Nucleic Acids Res. 2014 Feb;42(3):1733-46. doi: 10.1093/nar/gkt1117. Epub 2013 Nov 12.