• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统探索焦磷酸测序图谱数据中的误差来源。

Systematic exploration of error sources in pyrosequencing flowgram data.

机构信息

Institute of Marine Research, P.O. Box 1870, N-5817 Bergen, Norway.

出版信息

Bioinformatics. 2011 Jul 1;27(13):i304-9. doi: 10.1093/bioinformatics/btr251.

DOI:10.1093/bioinformatics/btr251
PMID:21685085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3117331/
Abstract

MOTIVATION

454 pyrosequencing, by Roche Diagnostics, has emerged as an alternative to Sanger sequencing when it comes to read lengths, performance and cost, but shows higher per-base error rates. Although there are several tools available for noise removal, targeting different application fields, data interpretation would benefit from a better understanding of the different error types.

RESULTS

By exploring 454 raw data, we quantify to what extent different factors account for sequencing errors. In addition to the well-known homopolymer length inaccuracies, we have identified errors likely to originate from other stages of the sequencing process. We use our findings to extend the flowsim pipeline with functionalities to simulate these errors, and thus enable a more realistic simulation of 454 pyrosequencing data with flowsim.

AVAILABILITY

The flowsim pipeline is freely available under the General Public License from http://biohaskell.org/Applications/FlowSim.

CONTACT

susanne.balzer@imr.no.

摘要

动机

罗氏诊断公司的 454 焦磷酸测序在读取长度、性能和成本方面已经取代了桑格测序,但它的每个碱基错误率更高。虽然有几种工具可用于去除噪声,针对不同的应用领域,但数据解释将受益于更好地了解不同的错误类型。

结果

通过探索 454 原始数据,我们量化了不同因素在多大程度上导致了测序错误。除了众所周知的长重复序列长度不准确之外,我们还确定了可能源自测序过程其他阶段的错误。我们利用这些发现扩展了 flowsim 管道的功能,以模拟这些错误,从而使 flowsim 能够更真实地模拟 454 焦磷酸测序数据。

可用性

flowsim 管道可根据通用公共许可证从 http://biohaskell.org/Applications/FlowSim 免费获得。

联系方式

susanne.balzer@imr.no.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/7da64db4ab7a/btr251f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/a3575d4630d6/btr251f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/85c1adc8bff8/btr251f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/c3571d0c345a/btr251f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/7da64db4ab7a/btr251f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/a3575d4630d6/btr251f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/85c1adc8bff8/btr251f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/c3571d0c345a/btr251f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b4/3117331/7da64db4ab7a/btr251f4.jpg

相似文献

1
Systematic exploration of error sources in pyrosequencing flowgram data.系统探索焦磷酸测序图谱数据中的误差来源。
Bioinformatics. 2011 Jul 1;27(13):i304-9. doi: 10.1093/bioinformatics/btr251.
2
Characteristics of 454 pyrosequencing data--enabling realistic simulation with flowsim.454 焦磷酸测序数据的特征——使用 flowsim 进行现实模拟。
Bioinformatics. 2010 Sep 15;26(18):i420-5. doi: 10.1093/bioinformatics/btq365.
3
Filtering duplicate reads from 454 pyrosequencing data.从 454 焦磷酸测序数据中过滤重复读取。
Bioinformatics. 2013 Apr 1;29(7):830-6. doi: 10.1093/bioinformatics/btt047. Epub 2013 Feb 1.
4
Quality score based identification and correction of pyrosequencing errors.基于质量得分的焦磷酸测序错误识别与校正。
PLoS One. 2013 Sep 5;8(9):e73015. doi: 10.1371/journal.pone.0073015. eCollection 2013.
5
Simulating a population genomics data set using FlowSim.使用FlowSim模拟一个群体基因组数据集。
BMC Res Notes. 2014 Jan 31;7:68. doi: 10.1186/1756-0500-7-68.
6
Designing robust watermark barcodes for multiplex long-read sequencing.为多重长读长测序设计稳健的水印条形码。
Bioinformatics. 2017 Mar 15;33(6):807-813. doi: 10.1093/bioinformatics/btw322.
7
SimLoRD: Simulation of Long Read Data.SimLoRD:长读长数据模拟
Bioinformatics. 2016 Sep 1;32(17):2704-6. doi: 10.1093/bioinformatics/btw286. Epub 2016 May 10.
8
Fundamentals of pyrosequencing.焦磷酸测序技术基础。
Arch Pathol Lab Med. 2013 Sep;137(9):1296-303. doi: 10.5858/arpa.2012-0463-RA.
9
Bias from removing read duplication in ultra-deep sequencing experiments.超深度测序实验中去除重复读取所导致的偏差。
Bioinformatics. 2014 Apr 15;30(8):1073-1080. doi: 10.1093/bioinformatics/btt771. Epub 2014 Jan 2.
10
Aggressive assembly of pyrosequencing reads with mates.将焦磷酸测序读数与配对序列进行积极组装。
Bioinformatics. 2008 Dec 15;24(24):2818-24. doi: 10.1093/bioinformatics/btn548. Epub 2008 Oct 24.

引用本文的文献

1
Rare deleterious germline variants and risk of lung cancer.罕见有害生殖系变异与肺癌风险
NPJ Precis Oncol. 2021 Feb 16;5(1):12. doi: 10.1038/s41698-021-00146-7.
2
Fungal metabarcoding data integration framework for the MycoDiversity DataBase (MDDB).真菌代谢组条形码数据集成框架,用于 MycoDiversity DataBase (MDDB)。
J Integr Bioinform. 2020 May 28;17(1):20190046. doi: 10.1515/jib-2019-0046.
3
Insights from the revised complete genome sequences of strains AB307-0294 and ACICU belonging to global clones 1 and 2.

本文引用的文献

1
Removing noise from pyrosequenced amplicons.从焦磷酸测序扩增子中去除噪声。
BMC Bioinformatics. 2011 Jan 28;12:38. doi: 10.1186/1471-2105-12-38.
2
Characteristics of 454 pyrosequencing data--enabling realistic simulation with flowsim.454 焦磷酸测序数据的特征——使用 flowsim 进行现实模拟。
Bioinformatics. 2010 Sep 15;26(18):i420-5. doi: 10.1093/bioinformatics/btq365.
3
Artificial and natural duplicates in pyrosequencing reads of metagenomic data.元基因组数据焦磷酸测序reads 中的人工和天然重复。
来自全球克隆 1 和 2 中的菌株 AB307-0294 和 ACICU 的修订全基因组序列的见解。
Microb Genom. 2019 Oct;5(10). doi: 10.1099/mgen.0.000298.
4
Formal description of sequence-based voucherless : promises and pitfalls, and how to resolve them.基于序列的无凭证形式化描述:承诺与陷阱,以及如何解决这些问题。
IMA Fungus. 2018 Jun;9(1):143-166. doi: 10.5598/imafungus.2018.09.01.09. Epub 2018 May 22.
5
Enhanced detection of microsatellite instability using pre-PCR elimination of wild-type DNA homo-polymers in tissue and liquid biopsies.使用预 PCR 消除组织和液体活检中野生型 DNA 同聚物来增强微卫星不稳定性检测。
Nucleic Acids Res. 2018 Jul 6;46(12):e74. doi: 10.1093/nar/gky251.
6
Analytical parameters and validation of homopolymer detection in a pyrosequencing-based next generation sequencing system.基于焦磷酸测序的新一代测序系统中同源聚合物检测的分析参数和验证。
BMC Genomics. 2018 Feb 21;19(1):158. doi: 10.1186/s12864-018-4544-x.
7
Diversity and Structure of Fungal Communities in Neotropical Rainforest Soils: The Effect of Host Recurrence.新热带雨林土壤中真菌群落的多样性与结构:宿主重现的影响
Microb Ecol. 2017 Feb;73(2):310-320. doi: 10.1007/s00248-016-0839-0. Epub 2016 Sep 19.
8
MonoSeq Variant Caller Reveals Novel Mononucleotide Run Indel Mutations in Tumors with Defective DNA Mismatch Repair.单序列变异检测工具揭示了DNA错配修复缺陷肿瘤中的新型单核苷酸重复插入缺失突变。
Hum Mutat. 2016 Oct;37(10):1004-12. doi: 10.1002/humu.23036. Epub 2016 Aug 8.
9
Biomarker in Colorectal Cancer.结直肠癌中的生物标志物。
Cancer J. 2016 May-Jun;22(3):156-64. doi: 10.1097/PPO.0000000000000190.
10
A comparison of tools for the simulation of genomic next-generation sequencing data.用于模拟基因组下一代测序数据的工具比较。
Nat Rev Genet. 2016 Aug;17(8):459-69. doi: 10.1038/nrg.2016.57. Epub 2016 Jun 20.
BMC Bioinformatics. 2010 Apr 13;11:187. doi: 10.1186/1471-2105-11-187.
4
The European sea bass Dicentrarchus labrax genome puzzle: comparative BAC-mapping and low coverage shotgun sequencing.欧洲鲈鱼 Dicentrarchus labrax 基因组之谜:比较 BAC 图谱和低覆盖度鸟枪法测序。
BMC Genomics. 2010 Jan 27;11:68. doi: 10.1186/1471-2164-11-68.
5
The effect of sequencing errors on metagenomic gene prediction.测序错误对宏基因组基因预测的影响。
BMC Genomics. 2009 Nov 12;10:520. doi: 10.1186/1471-2164-10-520.
6
Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates.稀有生物界的褶皱:焦磷酸测序错误可能导致多样性估计的人为膨胀。
Environ Microbiol. 2010 Jan;12(1):118-23. doi: 10.1111/j.1462-2920.2009.02051.x. Epub 2009 Aug 27.
7
Accurate determination of microbial diversity from 454 pyrosequencing data.从454焦磷酸测序数据中准确测定微生物多样性。
Nat Methods. 2009 Sep;6(9):639-41. doi: 10.1038/nmeth.1361. Epub 2009 Aug 9.
8
Systematic artifacts in metagenomes from complex microbial communities.来自复杂微生物群落的宏基因组中的系统假象。
ISME J. 2009 Nov;3(11):1314-7. doi: 10.1038/ismej.2009.72. Epub 2009 Jul 9.
9
Evaluation of next generation sequencing platforms for population targeted sequencing studies.用于群体靶向测序研究的新一代测序平台评估
Genome Biol. 2009;10(3):R32. doi: 10.1186/gb-2009-10-3-r32. Epub 2009 Mar 27.
10
Pyrobayes: an improved base caller for SNP discovery in pyrosequences.Pyrobayes:用于焦磷酸测序中SNP发现的改进型碱基识别器。
Nat Methods. 2008 Feb;5(2):179-81. doi: 10.1038/nmeth.1172. Epub 2008 Jan 13.