• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Basecalling with LifeTrace.使用LifeTrace进行碱基识别
Genome Res. 2001 May;11(5):875-88. doi: 10.1101/gr.177901.
2
Evaluation of window cohabitation of DNA sequencing errors and lowest PHRED quality values.DNA测序错误与最低PHRED质量值的窗口共存评估。
Genet Mol Res. 2004 Dec 30;3(4):483-92.
3
POSA: perl objects for DNA sequencing data analysis.POSA:用于DNA测序数据分析的Perl对象。
BMC Genomics. 2004 Aug 27;5(1):60. doi: 10.1186/1471-2164-5-60.
4
Single-molecule DNA sequencing of a viral genome.病毒基因组的单分子DNA测序
Science. 2008 Apr 4;320(5872):106-9. doi: 10.1126/science.1150427.
5
Performance of neural network basecalling tools for Oxford Nanopore sequencing.基于神经网络的牛津纳米孔测序碱基调用工具的性能。
Genome Biol. 2019 Jun 24;20(1):129. doi: 10.1186/s13059-019-1727-y.
6
VSQual: a visual system to assist DNA sequencing quality control.VSQual:一种辅助DNA测序质量控制的视觉系统。
Genet Mol Res. 2004 Dec 30;3(4):474-82.
7
EDAR: an efficient error detection and removal algorithm for next generation sequencing data.EDAR:一种用于下一代测序数据的高效错误检测与去除算法。
J Comput Biol. 2010 Nov;17(11):1549-60. doi: 10.1089/cmb.2010.0127. Epub 2010 Oct 25.
8
Model-P: a basecalling method for resequencing microarrays of diploid samples.模型-P:一种用于二倍体样本重测序微阵列的碱基识别方法。
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii182-9. doi: 10.1093/bioinformatics/bti1129.
9
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
10
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.OXBench:一种用于评估蛋白质多序列比对准确性的基准。
BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.

引用本文的文献

1
SeqTrace: a graphical tool for rapidly processing DNA sequencing chromatograms.SeqTrace:一种用于快速处理DNA测序色谱图的图形工具。
J Biomol Tech. 2012 Sep;23(3):90-3. doi: 10.7171/jbt.12-2303-004.
2
Computational biology methods and their application to the comparative genomics of endocellular symbiotic bacteria of insects.计算生物学方法及其在昆虫内生共生菌比较基因组学中的应用。
Biol Proced Online. 2009 Mar 11;11:52-78. doi: 10.1007/s12575-009-9004-1.
3
Quality scores and SNP detection in sequencing-by-synthesis systems.合成测序系统中的质量评分与单核苷酸多态性检测
Genome Res. 2008 May;18(5):763-70. doi: 10.1101/gr.070227.107. Epub 2008 Jan 22.
4
Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs.使用miraEST组装程序在已测序的EST中进行可靠且自动化的mRNA转录本组装和SNP检测。
Genome Res. 2004 Jun;14(6):1147-59. doi: 10.1101/gr.1917404. Epub 2004 May 12.
5
SNPs by AFLP (SBA): a rapid SNP isolation strategy for non-model organisms.基于扩增片段长度多态性的单核苷酸多态性(SBA):一种用于非模式生物的快速单核苷酸多态性分离策略。
Nucleic Acids Res. 2003 Mar 1;31(5):e19. doi: 10.1093/nar/gng019.

本文引用的文献

1
An SNP map of the human genome generated by reduced representation shotgun sequencing.通过简化基因组鸟枪法测序生成的人类基因组单核苷酸多态性图谱。
Nature. 2000 Sep 28;407(6803):513-6. doi: 10.1038/35035083.
2
Reliable identification of large numbers of candidate SNPs from public EST data.从公共EST数据中可靠地识别大量候选单核苷酸多态性。
Nat Genet. 1999 Mar;21(3):323-5. doi: 10.1038/6851.
3
A software system for data analysis in automated DNA sequencing.一种用于自动DNA测序数据分析的软件系统。
Genome Res. 1998 Jun;8(6):644-65. doi: 10.1101/gr.8.6.644.
4
Estimation of errors in "raw" DNA sequences: a validation study.“原始”DNA序列中误差的估计:一项验证研究。
Genome Res. 1998 Mar;8(3):251-9. doi: 10.1101/gr.8.3.251.
5
Base-calling of automated sequencer traces using phred. II. Error probabilities.使用Phred对自动测序仪追踪结果进行碱基识别。II. 错误概率。
Genome Res. 1998 Mar;8(3):186-94.
6
Base-calling of automated sequencer traces using phred. I. Accuracy assessment.使用Phred对自动测序仪轨迹进行碱基识别。I. 准确性评估。
Genome Res. 1998 Mar;8(3):175-85. doi: 10.1101/gr.8.3.175.
7
A graph theoretic approach to the analysis of DNA sequencing data.一种用于分析DNA测序数据的图论方法。
Genome Res. 1996 Feb;6(2):80-91. doi: 10.1101/gr.6.2.80.
8
An adaptive, object oriented strategy for base calling in DNA sequence analysis.一种用于DNA序列分析中碱基识别的自适应、面向对象策略。
Nucleic Acids Res. 1993 Sep 25;21(19):4530-40. doi: 10.1093/nar/21.19.4530.
9
Assignment of position-specific error probability to primary DNA sequence data.将特定位置的错误概率分配到原始DNA序列数据中。
Nucleic Acids Res. 1994 Apr 11;22(7):1272-80. doi: 10.1093/nar/22.7.1272.
10
Pattern recognition for automated DNA sequencing: I. On-line signal conditioning and feature extraction for basecalling.
Proc Int Conf Intell Syst Mol Biol. 1993;1:136-44.

使用LifeTrace进行碱基识别

Basecalling with LifeTrace.

作者信息

Walther D, Bartha G, Morris M

机构信息

Incyte Genomics, Inc., Palo Alto, California 94304, USA.

出版信息

Genome Res. 2001 May;11(5):875-88. doi: 10.1101/gr.177901.

DOI:10.1101/gr.177901
PMID:11337481
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC311100/
Abstract

A pivotal step in electrophoresis sequencing is the conversion of the raw, continuous chromatogram data into the actual sequence of discrete nucleotides, a process referred to as basecalling. We describe a novel algorithm for basecalling implemented in the program LifeTrace. Like Phred, currently the most widely used basecalling software program, LifeTrace takes processed trace data as input. It was designed to be tolerant to variable peak spacing by means of an improved peak-detection algorithm that emphasizes local chromatogram information over global properties. LifeTrace is shown to generate high-quality basecalls and reliable quality scores. It proved particularly effective when applied to MegaBACE capillary sequencing machines. In a benchmark test of 8372 dye-primer MegaBACE chromatograms, LifeTrace generated 17% fewer substitution errors, 16% fewer insertion/deletion errors, and 2.4% more aligned bases to the finished sequence than did Phred. For two sets totaling 6624 dye-terminator chromatograms, the performance improvement was 15% fewer substitution errors, 10% fewer insertion/deletion errors, and 2.1% more aligned bases. The processing time required by LifeTrace is comparable to that of Phred. The predicted quality scores were in line with observed quality scores, permitting direct use for quality clipping and in silico single nucleotide polymorphism (SNP) detection. Furthermore, we introduce a new type of quality score associated with every basecall: the gap-quality. It estimates the probability of a deletion error between the current and the following basecall. This additional quality score improves detection of single basepair deletions when used for locating potential basecalling errors during the alignment. We also describe a new protocol for benchmarking that we believe better discerns basecaller performance differences than methods previously published.

摘要

电泳测序中的关键步骤是将原始的连续色谱图数据转换为离散核苷酸的实际序列,这一过程称为碱基识别。我们描述了一种在LifeTrace程序中实现的用于碱基识别的新算法。与目前使用最广泛的碱基识别软件程序Phred一样,LifeTrace将处理后的痕量数据作为输入。它通过一种改进的峰检测算法设计为能够容忍可变的峰间距,该算法更强调局部色谱图信息而非全局特性。结果表明,LifeTrace能生成高质量的碱基识别结果和可靠的质量得分。在应用于MegaBACE毛细管测序仪时,它被证明特别有效。在对8372个染料引物MegaBACE色谱图的基准测试中,与Phred相比,LifeTrace产生的替换错误减少了17%,插入/缺失错误减少了16%,与完成序列的比对碱基增加了2.4%。对于总共6624个染料终止剂色谱图的两组数据,性能提升为替换错误减少15%,插入/缺失错误减少10%,比对碱基增加2.1%。LifeTrace所需的处理时间与Phred相当。预测的质量得分与观察到的质量得分一致,可直接用于质量剪切和计算机单核苷酸多态性(SNP)检测。此外,我们引入了一种与每个碱基识别相关的新型质量得分:间隙质量。它估计当前碱基识别与下一个碱基识别之间发生缺失错误的概率。当用于在比对过程中定位潜在的碱基识别错误时,这种额外的质量得分可改善单碱基对缺失的检测。我们还描述了一种新的基准测试方案,我们认为它比以前发表的方法能更好地辨别碱基识别器的性能差异。