Wang Ying, Wang Lei, Wong Leon, Zhao Bowei, Su Xiaorui, Li Yang, You Zhuhong
College of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China.
Big Data and Intelligent Computing Research Center, Guangxi Academy of Sciences, Nanning 530007, China.
Biology (Basel). 2022 May 13;11(5):741. doi: 10.3390/biology11050741.
As the basis for screening drug candidates, the identification of drug-target interactions (DTIs) plays a crucial role in the innovative drugs research. However, due to the inherent constraints of small-scale and time-consuming wet experiments, DTI recognition is usually difficult to carry out. In the present study, we developed a computational approach called RoFDT to predict DTIs by combining feature-weighted Rotation Forest (FwRF) with a protein sequence. In particular, we first encode protein sequences as numerical matrices by Position-Specific Score Matrix (PSSM), then extract their features utilize Pseudo Position-Specific Score Matrix (PsePSSM) and combine them with drug structure information-molecular fingerprints and finally feed them into the FwRF classifier and validate the performance of RoFDT on Enzyme, GPCR, Ion Channel and Nuclear Receptor datasets. In the above dataset, RoFDT achieved 91.68%, 84.72%, 88.11% and 78.33% accuracy, respectively. RoFDT shows excellent performance in comparison with support vector machine models and previous superior approaches. Furthermore, 7 of the top 10 DTIs with RoFDT estimate scores were proven by the relevant database. These results demonstrate that RoFDT can be employed to a powerful predictive approach for DTIs to provide theoretical support for innovative drug discovery.
作为筛选候选药物的基础,药物-靶点相互作用(DTIs)的识别在创新药物研究中起着至关重要的作用。然而,由于小规模和耗时的湿实验的固有局限性,DTIs识别通常难以进行。在本研究中,我们开发了一种名为RoFDT的计算方法,通过将特征加权旋转森林(FwRF)与蛋白质序列相结合来预测DTIs。具体而言,我们首先通过位置特异性得分矩阵(PSSM)将蛋白质序列编码为数值矩阵,然后利用伪位置特异性得分矩阵(PsePSSM)提取其特征,并将它们与药物结构信息——分子指纹相结合,最后将它们输入到FwRF分类器中,并在酶、GPCR、离子通道和核受体数据集上验证RoFDT的性能。在上述数据集中,RoFDT的准确率分别达到了91.68%、84.72%、88.11%和78.33%。与支持向量机模型和先前的优秀方法相比,RoFDT表现出优异的性能。此外,RoFDT估计得分最高的前10个DTIs中有7个得到了相关数据库的证实。这些结果表明,RoFDT可作为一种强大的DTIs预测方法,为创新药物发现提供理论支持。