Lipman D J, Smith T F, Beckman R J, Waterman M S
Nucleic Acids Res. 1982 Sep 11;10(17):5375-89. doi: 10.1093/nar/10.17.5375.
Five recently sequenced hemagglutinin genes from Influenza A virus strains are studied for similarities in a hierarchical fashion. The sequences are compared for similarity, first on the level of sequence homology, and then on several progressively more general levels. Though the HA1 subsequences contain regions where homology drops to that of a Monte Carlo generated reference value, subsequent tests reveal great similarity due to constraints on the level of amino acid sequence. Other tests detect statistically significant differences between subtypes due to constraints acting below the level of amino acid sequence, such as the 2 degrees structure of the viral RNA, or involving translation of the mRNA. The general applicability of the hierarchical approach to sequence analysis is discussed.
对最近测序的5个甲型流感病毒株血凝素基因进行分层研究以寻找相似性。首先在序列同源性水平上比较序列的相似性,然后在几个逐渐更综合的水平上进行比较。虽然HA1子序列包含一些区域,其同源性降至蒙特卡洛生成的参考值水平,但后续测试显示,由于氨基酸序列水平的限制,仍存在很大的相似性。其他测试检测到不同亚型之间存在统计学上的显著差异,这是由于在氨基酸序列水平以下起作用的限制因素,如病毒RNA的二级结构,或涉及mRNA的翻译。讨论了分层方法在序列分析中的一般适用性。