Galitsky B A, Gelfand I M, Kister A E
Department of Mathematics, Rutgers University, Piscataway, NJ 08854, USA.
Proc Natl Acad Sci U S A. 1998 Apr 28;95(9):5193-8. doi: 10.1073/pnas.95.9.5193.
A new method for classification of Ig sequences is suggested. The defining characteristic of a class is presence of particular residues at several class-determining positions. Sequences within a class follow the same amino acid pattern, i.e., residues at identical positions are, in an overwhelming majority of sequences of that class, identical or chemically related. Thus, once the class of a sequence is determined, one can predict the residue(s) at almost any position in the sequence. In this paper, results of analysis of 1,172 human heavy chains are presented. It was shown that a sequence can be assigned to one of six classes depending on which residues are found at its positions 1, 3, 5, 6, 7, 9, 10, 12, and 13. It is important to note that it is possible to achieve same six-class classification of the human heavy chains on the basis of a different set of positions found not at the beginning but near the end of the sequence (around position 80). For every class, an amino acid pattern of an entire sequence (complementarity determining regions excepting) has been determined. Our approach allowed us to reconstruct the incomplete human heavy chains in which residues at certain positions at the beginning or end of the chain are known. We developed a software tool for analysis, classification, and prediction of residues in sequences of the Ig family.
本文提出了一种免疫球蛋白(Ig)序列分类的新方法。一类的定义特征是在几个类别决定位置存在特定残基。同一类中的序列遵循相同的氨基酸模式,即,在该类的绝大多数序列中,相同位置的残基是相同的或化学相关的。因此,一旦确定了一个序列的类别,就可以预测该序列几乎任何位置的残基。本文展示了对1172条人类重链的分析结果。结果表明,根据序列第1、3、5、6、7、9、10、12和13位发现的残基,一个序列可以被归为六个类别之一。需要注意的是,基于序列末尾(约80位附近)发现的另一组不同位置,也有可能实现人类重链的相同六类分类。对于每一类,已经确定了整个序列(不包括互补决定区)的氨基酸模式。我们的方法使我们能够重建不完整的人类重链,其中链起始或末尾某些位置的残基是已知的。我们开发了一种软件工具,用于分析、分类和预测Ig家族序列中的残基。