Gong Haipeng, Rose George D
Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218-2608, USA.
Proteins. 2005 Nov 1;61(2):338-43. doi: 10.1002/prot.20622.
Is highly approximate knowledge of a protein's backbone structure sufficient to successfully identify its family, superfamily, and tertiary fold? To explore this question, backbone dihedral angles were extracted from the known three-dimensional structure of 2,439 proteins and mapped into 36 labeled, 60 degrees x 60 degrees bins, called mesostates. Using this coarse-grained mapping, protein conformation can be approximated by a linear sequence of mesostates. These linear strings can then be aligned and assessed by conventional sequence-comparison methods. We report that the mesostate sequence is sufficient to recognize a protein's family, superfamily, and fold with good fidelity.
关于蛋白质主链结构的高度近似知识是否足以成功识别其家族、超家族和三级折叠结构?为了探究这个问题,从2439种蛋白质的已知三维结构中提取主链二面角,并将其映射到36个标记的、60度×60度的箱中,即所谓的中间状态。利用这种粗粒度映射,蛋白质构象可以用中间状态的线性序列来近似。然后可以通过传统的序列比较方法对这些线性字符串进行比对和评估。我们报告称,中间状态序列足以以较高的保真度识别蛋白质的家族、超家族和折叠结构。