Suppr超能文献

带有质量分数的生物序列比对。

Alignment of biological sequences with quality scores.

作者信息

Na Joong Chae, Roh Kangho, Apostolico Alberto, Park Kunsoo

机构信息

Department of Computer Engineering, Sejong University, Seoul 143-747, South Korea.

出版信息

Int J Bioinform Res Appl. 2009;5(1):97-113. doi: 10.1504/IJBRA.2009.022466.

Abstract

In this paper we consider the problem of sequence alignment with quality scores. DNA sequences produced by a base-calling program (as part of sequencing) have quality scores which represent the confidence level for individual bases. However, previous sequence alignment algorithms do not consider such quality scores. To solve sequence alignment with quality scores, we first consider a more general problem where the input is weighted sequences which are sequences with probabilities that characters occur in each position. We propose a meaningful measure of an alignment of two weighted sequences and show that an optimal alignment in this measure can be found by dynamic programming. Sequence alignment with quality scores can be solved as a special case of the weighted sequence alignment problem.

摘要

在本文中,我们考虑带质量分数的序列比对问题。碱基识别程序(作为测序的一部分)产生的DNA序列具有质量分数,这些质量分数代表各个碱基的置信水平。然而,先前的序列比对算法并未考虑此类质量分数。为了解决带质量分数的序列比对问题,我们首先考虑一个更一般的问题,其输入是加权序列,即字符在每个位置出现的概率的序列。我们提出了一种对两个加权序列比对的有意义的度量,并表明通过动态规划可以找到此度量下的最优比对。带质量分数的序列比对可以作为加权序列比对问题的一个特殊情况来解决。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验