• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PCR 诱导的转换是超深度焦磷酸测序数据清洗后主要的错误来源。

PCR-induced transitions are the major source of error in cleaned ultra-deep pyrosequencing data.

机构信息

Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden.

出版信息

PLoS One. 2013 Jul 23;8(7):e70388. doi: 10.1371/journal.pone.0070388. Print 2013.

DOI:10.1371/journal.pone.0070388
PMID:23894647
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3720931/
Abstract

BACKGROUND

Ultra-deep pyrosequencing (UDPS) is used to identify rare sequence variants. The sequence depth is influenced by several factors including the error frequency of PCR and UDPS. This study investigated the characteristics and source of errors in raw and cleaned UDPS data.

RESULTS

UDPS of a 167-nucleotide fragment of the HIV-1 SG3Δenv plasmid was performed on the Roche/454 platform. The plasmid was diluted to one copy, PCR amplified and subjected to bidirectional UDPS on three occasions. The dataset consisted of 47,693 UDPS reads. Raw UDPS data had an average error frequency of 0.30% per nucleotide site. Most errors were insertions and deletions in homopolymeric regions. We used a cleaning strategy that removed almost all indel errors, but had little effect on substitution errors, which reduced the error frequency to 0.056% per nucleotide. In cleaned data the error frequency was similar in homopolymeric and non-homopolymeric regions, but varied considerably across sites. These site-specific error frequencies were moderately, but still significantly, correlated between runs (r=0.15-0.65) and between forward and reverse sequencing directions within runs (r=0.33-0.65). Furthermore, transition errors were 48-times more common than transversion errors (0.052% vs. 0.001%; p<0.0001). Collectively the results indicate that a considerable proportion of the sequencing errors that remained after data cleaning were generated during the PCR that preceded UDPS.

CONCLUSIONS

A majority of the sequencing errors that remained after data cleaning were introduced by PCR prior to sequencing, which means that they will be independent of platform used for next-generation sequencing. The transition vs. transversion error bias in cleaned UDPS data will influence the detection limits of rare mutations and sequence variants.

摘要

背景

超深度焦磷酸测序(UDPS)用于鉴定罕见的序列变异。序列深度受多种因素影响,包括 PCR 和 UDPS 的错误频率。本研究调查了原始和清理后的 UDPS 数据中的错误特征和来源。

结果

在 Roche/454 平台上对 HIV-1 SG3Δenv 质粒的 167 个核苷酸片段进行 UDPS。将质粒稀释至一个拷贝,进行 PCR 扩增,并在三个时间点进行双向 UDPS。数据集由 47693 个 UDPS 读取组成。原始 UDPS 数据的平均错误频率为每个核苷酸位置的 0.30%。大多数错误是在同聚核苷酸区域的插入和缺失。我们使用了一种清理策略,几乎去除了所有的插入缺失错误,但对替换错误几乎没有影响,将错误频率降低到每个核苷酸的 0.056%。在清理数据中,同聚核苷酸和非同聚核苷酸区域的错误频率相似,但在各个位置变化很大。这些位置特异性错误频率在运行之间(r=0.15-0.65)和运行内的正向和反向测序方向之间(r=0.33-0.65)中度但仍然显著相关。此外,转换错误比颠换错误常见 48 倍(0.052%对 0.001%;p<0.0001)。总的来说,这些结果表明,在 UDPS 之前的 PCR 过程中产生了大量在数据清理后仍然存在的测序错误。

结论

在数据清理后仍然存在的大多数测序错误是在测序之前的 PCR 中引入的,这意味着它们将独立于用于下一代测序的平台。在清理后的 UDPS 数据中,转换错误与颠换错误的错误偏向将影响稀有突变和序列变异的检测极限。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/f7444e34cc95/pone.0070388.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/ffd6892d72aa/pone.0070388.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/6589aab737f5/pone.0070388.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/1fe965c977c3/pone.0070388.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/f7444e34cc95/pone.0070388.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/ffd6892d72aa/pone.0070388.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/6589aab737f5/pone.0070388.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/1fe965c977c3/pone.0070388.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63ac/3720931/f7444e34cc95/pone.0070388.g004.jpg

相似文献

1
PCR-induced transitions are the major source of error in cleaned ultra-deep pyrosequencing data.PCR 诱导的转换是超深度焦磷酸测序数据清洗后主要的错误来源。
PLoS One. 2013 Jul 23;8(7):e70388. doi: 10.1371/journal.pone.0070388. Print 2013.
2
Performance of ultra-deep pyrosequencing in analysis of HIV-1 pol gene variation.超深度焦磷酸测序技术在 HIV-1 pol 基因变异分析中的性能评估。
PLoS One. 2011;6(7):e22741. doi: 10.1371/journal.pone.0022741. Epub 2011 Jul 25.
3
Analysis of 454 sequencing error rate, error sources, and artifact recombination for detection of Low-frequency drug resistance mutations in HIV-1 DNA.分析 454 测序错误率、错误来源和人工制品重组,以检测 HIV-1 DNA 中的低频耐药突变。
Retrovirology. 2013 Feb 13;10:18. doi: 10.1186/1742-4690-10-18.
4
Challenges with using primer IDs to improve accuracy of next generation sequencing.使用引物ID提高下一代测序准确性所面临的挑战。
PLoS One. 2015 Mar 5;10(3):e0119123. doi: 10.1371/journal.pone.0119123. eCollection 2015.
5
Analysis of error profiles in deep next-generation sequencing data.深度下一代测序数据中的错误分析。
Genome Biol. 2019 Mar 14;20(1):50. doi: 10.1186/s13059-019-1659-6.
6
Dynamics of HIV-1 quasispecies during antiviral treatment dissected using ultra-deep pyrosequencing.采用超深度焦磷酸测序技术解析抗病毒治疗过程中 HIV-1 准种的动态变化。
PLoS One. 2010 Jul 7;5(7):e11345. doi: 10.1371/journal.pone.0011345.
7
Ultra-deep pyrosequencing (UDPS) data treatment to study amplicon HCV minor variants.超深度焦磷酸测序(UDPS)数据分析处理用于研究 HCV 扩增子的次要变异体。
PLoS One. 2013 Dec 31;8(12):e83361. doi: 10.1371/journal.pone.0083361. eCollection 2013.
8
'Sentinel' mutations in standard population sequencing can predict the presence of HIV-1 reverse transcriptase major mutations detectable only by ultra-deep pyrosequencing.标准人群测序中的“哨兵”突变可预测仅通过超深度焦磷酸测序才能检测到的 HIV-1 逆转录酶主要突变的存在。
J Antimicrob Chemother. 2011 Nov;66(11):2615-23. doi: 10.1093/jac/dkr354. Epub 2011 Sep 2.
9
Primer ID Validates Template Sampling Depth and Greatly Reduces the Error Rate of Next-Generation Sequencing of HIV-1 Genomic RNA Populations.引物ID验证模板采样深度并大幅降低HIV-1基因组RNA群体下一代测序的错误率。
J Virol. 2015 Aug;89(16):8540-55. doi: 10.1128/JVI.00522-15. Epub 2015 Jun 3.
10
Barcode-free next-generation sequencing error validation for ultra-rare variant detection.无条码的下一代测序错误验证,用于超低频变异检测。
Nat Commun. 2019 Feb 28;10(1):977. doi: 10.1038/s41467-019-08941-4.

引用本文的文献

1
Chasing Sequencing Perfection: Marching Toward Higher Accuracy and Lower Costs.追求测序完美:迈向更高精度和更低成本。
Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2). doi: 10.1093/gpbjnl/qzae024.
2
NGS-Based Tumor-Informed Analysis of Circulating Tumor DNA.基于二代测序的循环肿瘤DNA肿瘤知情分析
Methods Mol Biol. 2023;2684:179-197. doi: 10.1007/978-1-0716-3291-8_11.
3
Development of a versatile high-throughput mutagenesis assay with multiplexed short-read NGS using DNA-barcoded shuttle vector library amplified in .

本文引用的文献

1
Analysis of 454 sequencing error rate, error sources, and artifact recombination for detection of Low-frequency drug resistance mutations in HIV-1 DNA.分析 454 测序错误率、错误来源和人工制品重组,以检测 HIV-1 DNA 中的低频耐药突变。
Retrovirology. 2013 Feb 13;10:18. doi: 10.1186/1742-4690-10-18.
2
Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID.使用引物 ID 对 HIV-1 蛋白酶基因进行准确的取样和深度测序。
Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):20166-71. doi: 10.1073/pnas.1110064108. Epub 2011 Nov 30.
3
A comparison of parallel pyrosequencing and sanger clone-based sequencing and its impact on the characterization of the genetic diversity of HIV-1.
利用. 中扩增的带 DNA 条码的穿梭载体文库进行多重短读长 NGS 的多功能高通量诱变分析的建立
Elife. 2022 Oct 10;11:e83780. doi: 10.7554/eLife.83780.
4
On taming the effect of transcript level intra-condition count variation during differential expression analysis: A story of dogs, foxes and wolves.在差异表达分析中驯服转录本水平内条件计数变异的影响:以狗、狐狸和狼为例。
PLoS One. 2022 Sep 22;17(9):e0274591. doi: 10.1371/journal.pone.0274591. eCollection 2022.
5
Lessons to Learn from the Gut Microbiota: A Focus on Amyotrophic Lateral Sclerosis.从肠道微生物组中吸取教训:聚焦肌萎缩侧索硬化症。
Genes (Basel). 2022 May 12;13(5):865. doi: 10.3390/genes13050865.
6
Contamination Issue in Viral Metagenomics: Problems, Solutions, and Clinical Perspectives.病毒宏基因组学中的污染问题:问题、解决方案及临床视角
Front Microbiol. 2021 Oct 20;12:745076. doi: 10.3389/fmicb.2021.745076. eCollection 2021.
7
Bias in RNA-seq Library Preparation: Current Challenges and Solutions.RNA测序文库制备中的偏差:当前挑战与解决方案
Biomed Res Int. 2021 Apr 19;2021:6647597. doi: 10.1155/2021/6647597. eCollection 2021.
8
MIPP-Seq: ultra-sensitive rapid detection and validation of low-frequency mosaic mutations.MIPP-Seq:超灵敏快速检测和验证低频镶嵌突变。
BMC Med Genomics. 2021 Feb 12;14(1):47. doi: 10.1186/s12920-021-00893-3.
9
Dual Deep Sequencing Improves the Accuracy of Low-Frequency Somatic Mutation Detection in Cancer Gene Panel Testing.双重深度测序提高癌症基因panel 检测中低频种系突变检测的准确性。
Int J Mol Sci. 2020 May 16;21(10):3530. doi: 10.3390/ijms21103530.
10
SinoDuplex: An Improved Duplex Sequencing Approach to Detect Low-frequency Variants in Plasma cfDNA Samples.SinoDuplex:一种改进的双端测序方法,用于检测血浆 cfDNA 样本中的低频变异。
Genomics Proteomics Bioinformatics. 2020 Feb;18(1):81-90. doi: 10.1016/j.gpb.2020.02.003. Epub 2020 May 16.
平行焦磷酸测序与基于 Sanger 克隆测序的比较及其对 HIV-1 遗传多样性特征分析的影响。
PLoS One. 2011;6(10):e26745. doi: 10.1371/journal.pone.0026745. Epub 2011 Oct 21.
4
Minor variant detection in amplicons using 454 massive parallel pyrosequencing: experiences and considerations for successful applications.使用 454 大规模平行焦磷酸测序检测扩增子中的微小变异:成功应用的经验和考虑因素。
Biotechniques. 2011 Sep;51(3):167-77. doi: 10.2144/000113733.
5
Systematic exploration of error sources in pyrosequencing flowgram data.系统探索焦磷酸测序图谱数据中的误差来源。
Bioinformatics. 2011 Jul 1;27(13):i304-9. doi: 10.1093/bioinformatics/btr251.
6
Low-frequency HIV-1 drug resistance mutations and risk of NNRTI-based antiretroviral treatment failure: a systematic review and pooled analysis.低频 HIV-1 耐药突变与 NNRTI 为基础的抗逆转录病毒治疗失败风险:系统评价和汇总分析。
JAMA. 2011 Apr 6;305(13):1327-35. doi: 10.1001/jama.2011.375.
7
Removing noise from pyrosequenced amplicons.从焦磷酸测序扩增子中去除噪声。
BMC Bioinformatics. 2011 Jan 28;12:38. doi: 10.1186/1471-2105-12-38.
8
Effects of polymerase, template dilution and cycle number on PCR based 16 S rRNA diversity analysis using the deep sequencing method.聚合酶、模板稀释和循环数对基于深度测序的 16S rRNA 多样性分析的影响。
BMC Microbiol. 2010 Oct 12;10:255. doi: 10.1186/1471-2180-10-255.
9
Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies.下一代测序数据纠错和 HIV 准种可靠估计。
Nucleic Acids Res. 2010 Nov;38(21):7400-9. doi: 10.1093/nar/gkq655. Epub 2010 Jul 29.
10
Dynamics of HIV-1 quasispecies during antiviral treatment dissected using ultra-deep pyrosequencing.采用超深度焦磷酸测序技术解析抗病毒治疗过程中 HIV-1 准种的动态变化。
PLoS One. 2010 Jul 7;5(7):e11345. doi: 10.1371/journal.pone.0011345.