• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DUDE-Seq:用于靶向扩增子测序的快速、灵活且稳健的去噪方法

DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing.

作者信息

Lee Byunghan, Moon Taesup, Yoon Sungroh, Weissman Tsachy

机构信息

Electrical and Computer Engineering, Seoul National University, Seoul, Korea.

College of Information and Communication Engineering, Sungkyunkwan University, Suwon, Korea.

出版信息

PLoS One. 2017 Jul 27;12(7):e0181463. doi: 10.1371/journal.pone.0181463. eCollection 2017.

DOI:10.1371/journal.pone.0181463
PMID:28749987
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5531809/
Abstract

We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. Denoising in high-throughput sequencing has thus become a crucial process for boosting the reliability of downstream analyses. Our methodology, named DUDE-Seq, is derived from a general setting of reconstructing finite-valued source data corrupted by a discrete memoryless channel and effectively corrects substitution and homopolymer indel errors, the two major types of sequencing errors in most high-throughput targeted amplicon sequencing platforms. Our experimental studies with real and simulated datasets suggest that the proposed DUDE-Seq not only outperforms existing alternatives in terms of error-correction capability and time efficiency, but also boosts the reliability of downstream analyses. Further, the flexibility of DUDE-Seq enables its robust application to different sequencing platforms and analysis pipelines by simple updates of the noise model. DUDE-Seq is available at http://data.snu.ac.kr/pub/dude-seq.

摘要

我们考虑对下一代靶向扩增子测序产生的核苷酸序列中的错误进行校正。下一代测序(NGS)平台因其高通量能够提供大量测序数据,但相关错误率往往较高。因此,高通量测序中的去噪已成为提高下游分析可靠性的关键过程。我们的方法名为DUDE-Seq,源自对由离散无记忆信道损坏的有限值源数据进行重构的一般设置,并能有效校正替换错误和同聚物插入缺失错误,这是大多数高通量靶向扩增子测序平台中的两种主要测序错误类型。我们对真实和模拟数据集的实验研究表明,所提出的DUDE-Seq不仅在纠错能力和时间效率方面优于现有方法,还提高了下游分析的可靠性。此外,DUDE-Seq的灵活性使其能够通过简单更新噪声模型而稳健地应用于不同的测序平台和分析流程。DUDE-Seq可在http://data.snu.ac.kr/pub/dude-seq获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/25d4f267e760/pone.0181463.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/811899fa833a/pone.0181463.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/c30c10282951/pone.0181463.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/f1e2834fc5ee/pone.0181463.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/b4a058ba2284/pone.0181463.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/dbc4c9e59f05/pone.0181463.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/08c80cf47257/pone.0181463.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/ea4b48378ef2/pone.0181463.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/8e871ed464bb/pone.0181463.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/25d4f267e760/pone.0181463.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/811899fa833a/pone.0181463.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/c30c10282951/pone.0181463.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/f1e2834fc5ee/pone.0181463.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/b4a058ba2284/pone.0181463.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/dbc4c9e59f05/pone.0181463.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/08c80cf47257/pone.0181463.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/ea4b48378ef2/pone.0181463.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/8e871ed464bb/pone.0181463.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f142/5531809/25d4f267e760/pone.0181463.g009.jpg

相似文献

1
DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing.DUDE-Seq:用于靶向扩增子测序的快速、灵活且稳健的去噪方法
PLoS One. 2017 Jul 27;12(7):e0181463. doi: 10.1371/journal.pone.0181463. eCollection 2017.
2
Long-read amplicon denoising.长读扩增子去噪。
Nucleic Acids Res. 2019 Oct 10;47(18):e104. doi: 10.1093/nar/gkz657.
3
A comparative study of k-spectrum-based error correction methods for next-generation sequencing data analysis.基于k谱的下一代测序数据分析纠错方法的比较研究。
Hum Genomics. 2016 Jul 25;10 Suppl 2(Suppl 2):20. doi: 10.1186/s40246-016-0068-0.
4
HECTOR: a parallel multistage homopolymer spectrum based error corrector for 454 sequencing data.HECTOR:一种基于平行多阶段同聚物谱的 454 测序数据错误校正器。
BMC Bioinformatics. 2014 May 6;15:131. doi: 10.1186/1471-2105-15-131.
5
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
6
EC: an efficient error correction algorithm for short reads.EC:一种用于短读段的高效纠错算法。
BMC Bioinformatics. 2015;16 Suppl 17(Suppl 17):S2. doi: 10.1186/1471-2105-16-S17-S2. Epub 2015 Dec 7.
7
Performance evaluation of lossy quality compression algorithms for RNA-seq data.RNA-seq 数据有损质量压缩算法的性能评估。
BMC Bioinformatics. 2020 Jul 20;21(1):321. doi: 10.1186/s12859-020-03658-4.
8
IPED: a highly efficient denoising tool for Illumina MiSeq Paired-end 16S rRNA gene amplicon sequencing data.IPED:一种用于Illumina MiSeq双端16S rRNA基因扩增子测序数据的高效去噪工具。
BMC Bioinformatics. 2016 Apr 29;17(1):192. doi: 10.1186/s12859-016-1061-2.
9
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing.BAMClipper:从比对中移除引物,以最大限度减少扩增子下一代测序中的假阴性突变。
Sci Rep. 2017 May 8;7(1):1567. doi: 10.1038/s41598-017-01703-6.
10
In search of perfect reads.寻找完美的读数。
BMC Bioinformatics. 2015;16 Suppl 17(Suppl 17):S7. doi: 10.1186/1471-2105-16-S17-S7. Epub 2015 Dec 7.

引用本文的文献

1
Native Rhizobial Inoculation Improves Tomato Yield and Nutrient Uptake While Mitigating Heavy Metal Accumulation in a Conventional Farming System.在传统耕作系统中,接种本地根瘤菌可提高番茄产量和养分吸收,同时减轻重金属积累。
Microorganisms. 2025 Aug 15;13(8):1904. doi: 10.3390/microorganisms13081904.
2
Comparative metagenomic analysis of 16S RNA amplicon sequencing of bacterial population of an industrial site contaminated with halogenated anilines.对受卤代苯胺污染的工业场地细菌种群进行16S RNA扩增子测序的比较宏基因组分析。
Microbiol Resour Announc. 2025 Apr 10;14(4):e0085624. doi: 10.1128/mra.00856-24. Epub 2025 Feb 25.
3

本文引用的文献

1
Coming of age: ten years of next-generation sequencing technologies.成年:下一代测序技术的十年
Nat Rev Genet. 2016 May 17;17(6):333-51. doi: 10.1038/nrg.2016.49.
2
Effect of lossy compression of quality scores on variant calling.质量分数的有损压缩对变异检测的影响。
Brief Bioinform. 2017 Mar 1;18(2):183-194. doi: 10.1093/bib/bbw011.
3
Coverage analysis in a targeted amplicon-based next-generation sequencing panel for myeloid neoplasms.基于靶向扩增子的新一代测序 panel 在髓系肿瘤中的覆盖度分析。
From colon wall to tumor niche: Unraveling the microbiome's role in colorectal cancer progression.
从结肠壁到肿瘤微环境:解析微生物组在结直肠癌进展中的作用。
PLoS One. 2024 Oct 22;19(10):e0311233. doi: 10.1371/journal.pone.0311233. eCollection 2024.
4
Comprehensive analysis of the microbiome in honey highlights honey as a potential source for the isolation of beneficial bacterial strains.对蜂蜜微生物组的综合分析表明,蜂蜜可能是有益细菌菌株分离的潜在来源。
PeerJ. 2024 Mar 27;12:e17157. doi: 10.7717/peerj.17157. eCollection 2024.
5
The Possible Preventative Role of Lactate- and Butyrate-Producing Bacteria in Colorectal Carcinogenesis.产乳酸盐和丁酸盐细菌在结直肠癌发生中的可能预防作用。
Gut Liver. 2024 Jul 15;18(4):654-666. doi: 10.5009/gnl230385. Epub 2023 Nov 30.
6
Characteristics of rhizosphere and endogenous bacterial community of Ulleung-sanmaneul, an endemic plant in Korea: application for alleviating salt stress.韩国特有植物尤郎山麦蓝菜根际和内生细菌群落特征:缓解盐胁迫的应用。
Sci Rep. 2022 Dec 7;12(1):21124. doi: 10.1038/s41598-022-25731-z.
7
The Gut Microbiota at Different Developmental Stages of Reveals Potential Probiotic Bacteria for Improving Honeybee Health.不同发育阶段的肠道微生物群揭示了改善蜜蜂健康的潜在益生菌。
Microorganisms. 2022 Sep 29;10(10):1938. doi: 10.3390/microorganisms10101938.
8
Changes in Gut Microbiome upon Orchiectomy and Testosterone Administration in AOM/DSS-Induced Colon Cancer Mouse Model.去势和雄激素给药对 AOM/DSS 诱导的结肠癌小鼠模型肠道微生物组的影响。
Cancer Res Treat. 2023 Jan;55(1):196-218. doi: 10.4143/crt.2022.080. Epub 2022 Jul 1.
9
Influence of formulation on skin microbiota: A randomized, split face comparative clinical study.配方对皮肤微生物群的影响:一项随机、半脸对照临床研究。
J Ginseng Res. 2022 Mar;46(2):296-303. doi: 10.1016/j.jgr.2021.12.002. Epub 2021 Dec 14.
10
Effect of luxS encoding a synthase of quorum-sensing signal molecule AI-2 of Vibrio vulnificus on mouse gut microbiome.创伤弧菌群体感应信号分子 AI-2 合成酶 LuxS 对小鼠肠道微生物组的影响。
Appl Microbiol Biotechnol. 2022 May;106(9-10):3721-3734. doi: 10.1007/s00253-022-11935-w. Epub 2022 Apr 30.
J Clin Pathol. 2016 Sep;69(9):801-4. doi: 10.1136/jclinpath-2015-203580. Epub 2016 Feb 19.
4
Clinical application of next-generation sequencing for Mendelian diseases.孟德尔疾病的二代测序临床应用
Hum Genomics. 2015 Jun 16;9(1):10. doi: 10.1186/s40246-015-0031-5.
5
Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction.去噪DNA深度测序数据——高通量测序错误及其校正
Brief Bioinform. 2016 Jan;17(1):154-79. doi: 10.1093/bib/bbv029. Epub 2015 May 29.
6
Improved data analysis for the MinION nanopore sequencer.针对MinION纳米孔测序仪的数据分析改进。
Nat Methods. 2015 Apr;12(4):351-6. doi: 10.1038/nmeth.3290. Epub 2015 Feb 16.
7
Pollux: platform independent error correction of single and mixed genomes.Pollux:单基因组和混合基因组的平台无关错误校正
BMC Bioinformatics. 2015 Jan 16;16(1):10. doi: 10.1186/s12859-014-0435-6.
8
Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.深入了解Illumina MiSeq平台进行扩增子测序时的偏差和测序错误。
Nucleic Acids Res. 2015 Mar 31;43(6):e37. doi: 10.1093/nar/gku1341. Epub 2015 Jan 13.
9
CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing.CASPER:一种用于高通量扩增子测序中双端reads 的上下文感知方案。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S10. doi: 10.1186/1471-2105-15-S9-S10. Epub 2014 Sep 10.
10
Fiona: a parallel and automatic strategy for read error correction.菲奥娜:一种用于读取错误校正的并行自动策略。
Bioinformatics. 2014 Sep 1;30(17):i356-63. doi: 10.1093/bioinformatics/btu440.