He Yi, Rackovsky S, Yin Yanping, Scheraga Harold A
Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853; and.
Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853; and Department of Pharmacology and Systems Therapeutics, The Icahn School of Medicine at Mount Sinai, New York, NY 10029.
Proc Natl Acad Sci U S A. 2015 Apr 21;112(16):5029-32. doi: 10.1073/pnas.1504806112. Epub 2015 Apr 6.
The relationship between protein sequence and structure arises entirely from amino acid physical properties. An alternative method is therefore proposed to identify homologs in which residue equivalence is based exclusively on the pairwise physical property similarities of sequences. This approach, the property factor method (PFM), is entirely different from those in current use. A comparison is made between our method and PSI BLAST. We demonstrate that traditionally defined sequence similarity can be very low for pairs of sequences (which therefore cannot be identified using PSI BLAST), but similarity of physical property distributions results in almost identical 3D structures. The performance of PFM is shown to be better than that of PSI BLAST when sequence matching is comparable, based on a comparison using targets from CASP10 (89 targets) and CASP11 (51 targets). It is also shown that PFM outperforms PSI BLAST in informatically challenging targets.
蛋白质序列与结构之间的关系完全源于氨基酸的物理性质。因此,本文提出了另一种识别同源物的方法,其中残基等价性完全基于序列的成对物理性质相似性。这种方法,即性质因子法(PFM),与目前使用的方法完全不同。我们将我们的方法与PSI BLAST进行了比较。我们证明,对于成对的序列,传统定义的序列相似性可能非常低(因此无法使用PSI BLAST识别),但物理性质分布的相似性会导致几乎相同的三维结构。基于对来自CASP10(89个目标)和CASP11(51个目标)的目标进行比较,结果表明,当序列匹配具有可比性时,PFM的性能优于PSI BLAST。研究还表明,在信息学挑战性目标方面,PFM的表现优于PSI BLAST。