Bachinskiĭ A G, Iarygin A A, Kulichkov V A, Guseva E G
Mol Biol (Mosk). 1995 Jul-Aug;29(4):907-17.
A bank of images of protein families (PROF IMAGE bank) is developed using early published method and amino acid sequences of 24-th release SWISS-PROT bank (27752 sequences). The method relies on physical-chemical and structural properties of amino acids and on the choice of fragments of protein families for the most discriminate distinction of amino acid sequences of a family from representatives of other families or random sequences. Specifications of the algorithms of building the images, principles of amino acid sequences selection to form protein families, the structure of the bank, and characteristics of images of 163 protein families are described. The data are illustrated by the image of alpha-interferon precursors family. The results of the images comparison with all proteins of the SWISS-PROT bank and ways to use the PROF IMAGE bank for determination of possible functions of amino acid sequences are discussed.
利用早期发表的方法以及第24版SWISS-PROT数据库(27752个序列)的氨基酸序列,构建了一个蛋白质家族图像库(PROF IMAGE库)。该方法依赖于氨基酸的物理化学和结构特性,以及为了最有区分度地区分一个家族的氨基酸序列与其他家族的代表序列或随机序列而选择的蛋白质家族片段。描述了构建图像的算法规范、形成蛋白质家族的氨基酸序列选择原则、库的结构以及163个蛋白质家族图像的特征。以α-干扰素前体家族的图像为例进行了数据说明。讨论了这些图像与SWISS-PROT数据库中所有蛋白质的比较结果,以及使用PROF IMAGE库确定氨基酸序列可能功能的方法。