Auger I E, Lawrence C E
Bull Math Biol. 1989;51(1):39-54. doi: 10.1007/BF02458835.
Two algorithms for the efficient identification of segment neighborhoods are presented. A segment neighborhood is a set of contiguous residues that share common features. Two procedures are developed to efficiently find estimates for the parameters of the model that describe these features and for the residues that define the boundaries of each segment neighborhood. The algorithms can accept nearly any model of segment neighborhood, and can be applied with a broad class of best fit functions including least squares and maximum likelihood. The algorithms successively identify the most important features of the sequence. The application of one of these methods to the haemagglutinin protein of influenza virus reveals a possible mechanism for conformational change through the finding of a break in a strong heptad repeat structure.
本文提出了两种用于高效识别片段邻域的算法。片段邻域是一组具有共同特征的连续残基。开发了两种程序,以有效地找到描述这些特征的模型参数估计值,以及定义每个片段邻域边界的残基估计值。这些算法几乎可以接受任何片段邻域模型,并可与包括最小二乘法和最大似然法在内的广泛的最佳拟合函数一起应用。这些算法依次识别序列中最重要的特征。将其中一种方法应用于流感病毒的血凝素蛋白,通过在强七肽重复结构中发现一个断点,揭示了一种构象变化的可能机制。