Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, Russia.
Department of Bioinformatics, Russian National Research Medical University, Moscow 117997, Russia.
Int J Mol Sci. 2020 Oct 31;21(21):8152. doi: 10.3390/ijms21218152.
Computationally predicting the interaction of proteins and ligands presents three main directions: the search of new target proteins for ligands, the search of new ligands for targets, and predicting the interaction of new proteins and new ligands. We proposed an approach providing the fuzzy classification of protein sequences based on the ligand structural features to analyze the latter most complicated case. We tested our approach on five protein groups, which represented promised targets for drug-like ligands and differed in functional peculiarities. The training sets were built with the original procedure overcoming the data ambiguity. Our study showed the effective prediction of new targets for ligands with an average accuracy of 0.96. The prediction of new ligands for targets displayed the average accuracy 0.95; accuracy estimates were close to our previous results, comparable in accuracy to those of other methods or exceeded them. Using the fuzzy coefficients reflecting the target-to-ligand specificity, we provided predicting interactions for new proteins and new ligands; the obtained accuracy values from 0.89 to 0.99 were acceptable for such a sophisticated task. The protein kinase family case demonstrated the ability to account for subtle features of proteins and ligands required for the specificity of protein-ligand interaction.
为配体寻找新的靶蛋白,为靶标寻找新的配体,以及预测新的蛋白质和新的配体的相互作用。我们提出了一种方法,提供基于配体结构特征的蛋白质序列的模糊分类,以分析后一种最复杂的情况。我们在五个蛋白质组上测试了我们的方法,这些蛋白质组代表了有希望的药物样配体靶标,并且在功能上有特点。训练集是通过克服数据模糊性的原始程序构建的。我们的研究表明,对配体的新靶标进行了有效预测,平均准确率为 0.96。对靶标新配体的预测平均准确率为 0.95;准确性估计与我们以前的结果接近,与其他方法的准确性相当或超过了其他方法。使用反映靶标 - 配体特异性的模糊系数,我们提供了新的蛋白质和新的配体的相互作用预测;从 0.89 到 0.99 的获得的准确性值对于这样一个复杂的任务是可以接受的。蛋白激酶家族的案例证明了能够解释蛋白质和配体所需的细微特征的能力,这些特征是蛋白质 - 配体相互作用特异性所必需的。