Smith H O, Annau T M, Chandrasegaran S
Department of Molecular Biology and Genetics, School of Medicine, Johns Hopkins University, Baltimore, MD 21205.
Proc Natl Acad Sci U S A. 1990 Jan;87(2):826-30. doi: 10.1073/pnas.87.2.826.
We have developed a method for rapidly finding patterns of conserved amino acid residues (motifs) in groups of functionally related proteins. All 3-amino acid patterns in a group of proteins of the type aa1 d1 aa2 d2 aa3, where d1 and d2 are distances that can be varied in a range up to 24 residues, are accumulated into an array. Segments of the proteins containing those patterns that occur most frequently are aligned on each other by a scoring method that obtains an average relatedness value for all the amino acids in each column of the aligned sequence block based on the Dayhoff relatedness odds matrix. The automated method successfully finds and displays nearly all of the sequence motifs that have been previously reported to occur in 33 reverse transcriptases, 18 DNA integrases, and 30 DNA methyltransferases.
我们已经开发出一种方法,用于快速在功能相关的蛋白质组中找到保守氨基酸残基模式(基序)。对于一组aa1 d1 aa2 d2 aa3类型的蛋白质中的所有三氨基酸模式(其中d1和d2是可在多达24个残基范围内变化的距离),会累积到一个数组中。通过一种评分方法,将包含那些最频繁出现模式的蛋白质片段相互比对,该评分方法基于Dayhoff相关性几率矩阵,为比对序列块的每一列中的所有氨基酸获得一个平均相关性值。该自动化方法成功找到了并展示了几乎所有先前报道出现在33种逆转录酶、18种DNA整合酶和30种DNA甲基转移酶中的序列基序。