Gao Jun, Liu Qi, Kang Hong, Cao Zhiwei, Zhu Ruixin
College of Life Science and Biotechnology, Tongji University, Shanghai 200092, China.
College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China.
Int J Mol Sci. 2012;13(7):8752-8761. doi: 10.3390/ijms13078752. Epub 2012 Jul 16.
In recent years, although many ligand-binding site prediction methods have been developed, there has still been a great demand to improve the prediction accuracy and compare different prediction algorithms to evaluate their performances. In this work, in order to improve the performance of the protein-ligand binding site prediction method presented in our former study, a comparison of different binding site ranking lists was studied. Four kinds of properties, i.e., pocket size, distance from the protein centroid, sequence conservation and the number of hydrophobic residues, have been chosen as the corresponding ranking criterion respectively. Our studies show that the sequence conservation information helps to rank the real pockets with the most successful accuracy compared to others. At the same time, the pocket size and the distance of binding site from the protein centroid are also found to be helpful. In addition, a multi-view ranking aggregation method, which combines the information among those four properties, was further applied in our study. The results show that a better performance can be achieved by the aggregation of the complementary properties in the prediction of ligand-binding sites.
近年来,尽管已经开发了许多配体结合位点预测方法,但提高预测准确性并比较不同预测算法以评估其性能的需求仍然很大。在这项工作中,为了提高我们之前研究中提出的蛋白质-配体结合位点预测方法的性能,我们研究了不同结合位点排名列表的比较。分别选择了四种属性,即口袋大小、与蛋白质质心的距离、序列保守性和疏水残基数量,作为相应的排名标准。我们的研究表明,与其他属性相比,序列保守性信息有助于以最高的成功率对真实口袋进行排名。同时,还发现口袋大小和结合位点与蛋白质质心的距离也有帮助。此外,我们的研究进一步应用了一种多视图排名聚合方法,该方法结合了这四种属性之间的信息。结果表明,通过在配体结合位点预测中聚合互补属性可以实现更好的性能。