Suppr超能文献

基于片段对片段比较的多DNA和蛋白质序列比对。

Multiple DNA and protein sequence alignment based on segment-to-segment comparison.

作者信息

Morgenstern B, Dress A, Werner T

机构信息

National Research Center for Environment and Health, Institute of Mammalian Genetics, Neuherberg, Germany.

出版信息

Proc Natl Acad Sci U S A. 1996 Oct 29;93(22):12098-103. doi: 10.1073/pnas.93.22.12098.

Abstract

In this paper, a new way to think about, and to construct, pairwise as well as multiple alignments of DNA and protein sequences is proposed. Rather than forcing alignments to either align single residues or to introduce gaps by defining an alignment as a path running right from the source up to the sink in the associated dot-matrix diagram, we propose to consider alignments as consistent equivalence relations defined on the set of all positions occurring in all sequences under consideration. We also propose constructing alignments from whole segments exhibiting highly significant overall similarity rather than by aligning individual residues. Consequently, we present an alignment algorithm that (i) is based on segment-to-segment comparison instead of the commonly used residue-to-residue comparison and which (ii) avoids the well-known difficulties concerning the choice of appropriate gap penalties: gaps are not treated explicity, but remain as those parts of the sequences that do not belong to any of the aligned segments. Finally, we discuss the application of our algorithm to two test examples and compare it with commonly used alignment methods. As a first example, we aligned a set of 11 DNA sequences coding for functional helix-loop-helix proteins. Though the sequences show only low overall similarity, our program correctly aligned all of the 11 functional sites, which was a unique result among the methods tested. As a by-product, the reading frames of the sequences were identified. Next, we aligned a set of ribonuclease H proteins and compared our results with alignments produced by other programs as reported by McClure et al. [McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571-592]. Our program was one of the best scoring programs. However, in contrast to other methods, our protein alignments are independent of user-defined parameters.

摘要

本文提出了一种思考和构建DNA及蛋白质序列两两比对和多序列比对的新方法。我们不再强制比对要么对齐单个残基,要么通过将比对定义为关联点阵图中从起点到终点的路径来引入空位,而是建议将比对视为在所有考虑序列中出现的所有位置集合上定义的一致等价关系。我们还建议从整体上具有高度显著相似性的片段构建比对,而不是通过对齐单个残基来构建。因此,我们提出了一种比对算法,该算法:(i)基于片段与片段的比较而非常用的残基与残基的比较;(ii)避免了与选择合适空位罚分相关的众所周知的困难:空位不被明确处理,而是保留为序列中不属于任何已比对片段的那些部分。最后,我们讨论了我们的算法在两个测试示例中的应用,并将其与常用的比对方法进行了比较。作为第一个示例,我们对齐了一组编码功能性螺旋-环-螺旋蛋白的11个DNA序列。尽管这些序列整体相似性较低,但我们的程序正确对齐了所有11个功能位点,这在测试的方法中是独一无二的结果。作为副产品,还识别出了这些序列的阅读框。接下来,我们对齐了一组核糖核酸酶H蛋白,并将我们的结果与McClure等人[McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571 - 592]报道的其他程序产生的比对结果进行了比较。我们的程序是得分最高的程序之一。然而,与其他方法不同的是,我们的蛋白质比对独立于用户定义的参数。

相似文献

2
Using CLUSTAL for multiple sequence alignments.使用CLUSTAL进行多序列比对。
Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8.
7

引用本文的文献

2
An overview of technologies for MS-based proteomics-centric multi-omics.基于 MS 的蛋白质组学中心型多组学技术概述。
Expert Rev Proteomics. 2022 Mar;19(3):165-181. doi: 10.1080/14789450.2022.2070476. Epub 2022 May 2.

本文引用的文献

1
Motif-biased protein sequence alignment.基序偏好性蛋白质序列比对
J Comput Biol. 1994 Winter;1(4):297-310. doi: 10.1089/cmb.1994.1.297.
5
Comparative analysis of multiple protein-sequence alignment methods.多种蛋白质序列比对方法的比较分析
Mol Biol Evol. 1994 Jul;11(4):571-92. doi: 10.1093/oxfordjournals.molbev.a040138.
6
Similarities between protein 3-D structures.蛋白质三维结构之间的相似性。
Protein Eng. 1994 Oct;7(10):1175-87. doi: 10.1093/protein/7.10.1175.
8
Simultaneous comparison of three protein sequences.三种蛋白质序列的同步比较。
Proc Natl Acad Sci U S A. 1985 May;82(10):3073-7. doi: 10.1073/pnas.82.10.3073.
9
A flexible multiple sequence alignment program.一个灵活的多序列比对程序。
Nucleic Acids Res. 1988 Mar 11;16(5):1683-91. doi: 10.1093/nar/16.5.1683.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验