Schaeffer E, Sninsky J J
Proc Natl Acad Sci U S A. 1984 May;81(9):2902-6. doi: 10.1073/pnas.81.9.2902.
Proteins that are related evolutionarily may have diverged at the level of primary amino acid sequence while maintaining similar secondary structures. Computer analysis has been used to compare the open reading frames of the hepatitis B virus to those of the woodchuck hepatitis virus at the level of amino acid sequence, and to predict the relative hydrophilic character and the secondary structure of putative polypeptides. Similarity is seen at the levels of relative hydrophilicity and secondary structure, in the absence of sequence homology. These data reinforce the proposal that these open reading frames encode viral proteins. Computer analysis of this type can be more generally used to establish structural similarities between proteins that do not share obvious sequence homology as well as to assess whether an open reading frame is fortuitous or codes for a protein.
在进化上相关的蛋白质可能在一级氨基酸序列水平上发生了分化,同时保持相似的二级结构。计算机分析已被用于在氨基酸序列水平上比较乙型肝炎病毒的开放阅读框与土拨鼠肝炎病毒的开放阅读框,并预测推定多肽的相对亲水性特征和二级结构。在没有序列同源性的情况下,在相对亲水性和二级结构水平上可以看到相似性。这些数据支持了这些开放阅读框编码病毒蛋白的提议。这种类型的计算机分析可以更广泛地用于建立不具有明显序列同源性的蛋白质之间的结构相似性,以及评估一个开放阅读框是偶然出现的还是编码一种蛋白质。