Mamada Hideaki, Nomura Yukihiro, Uesawa Yoshihiro
Department of Medical Molecular Informatics, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose, Tokyo 204-8588, Japan.
Drug Metabolism and Pharmacokinetics Research Laboratories, Central Pharmaceutical Research Institute, Japan Tobacco Inc., 1-1, Murasaki-cho, Takatsuki, Osaka 569-1125, Japan.
ACS Omega. 2022 May 11;7(20):17055-17062. doi: 10.1021/acsomega.2c00261. eCollection 2022 May 24.
The toxicity, absorption, distribution, metabolism, and excretion properties of some targets are difficult to predict by quantitative structure-activity relationship analysis. Therefore, there is a need for a new prediction method that performs well for these targets. The aim of this study was to develop a new regression model of rat clearance (CL). We constructed a regression model using 1545 in-house compounds for which we had rat CL data. Molecular descriptors were calculated using molecular operating environment, alvaDesc, and ADMET Predictor software. The classification model of DeepSnap and Deep Learning (DeepSnap-DL) with images of the three-dimensional chemical structures of compounds as features was constructed, and the prediction probabilities for each compound were calculated. For molecular descriptor-based methods that use molecular descriptors and conventional machine learning algorithms selected by DataRobot, the correlation coefficient ( ) and root mean square error (RMSE) were 0.625-0.669 and 0.295-0.318, respectively. We combined molecular descriptors and prediction probability of DeepSnap-DL as features and developed a novel regression method we called the combination model. In the combination model with these two types of features and conventional algorithms selected by DataRobot, and RMSE were 0.710-0.769 and 0.247-0.278, respectively. This finding shows that the combination model performed better than molecular descriptor-based methods. Our combination model will contribute to the design of more rational compounds for drug discovery. This method may be applicable not only to rat CL but also to other pharmacokinetic and pharmacological activity and toxicity parameters; therefore, applying it to other parameters may help to accelerate drug discovery.
某些靶点的毒性、吸收、分布、代谢和排泄特性难以通过定量构效关系分析来预测。因此,需要一种对这些靶点表现良好的新预测方法。本研究的目的是开发一种新的大鼠清除率(CL)回归模型。我们使用1545种有大鼠CL数据的内部化合物构建了一个回归模型。使用分子操作环境、alvaDesc和ADMET Predictor软件计算分子描述符。构建了以化合物三维化学结构图像为特征的DeepSnap和深度学习(DeepSnap-DL)分类模型,并计算了每种化合物的预测概率。对于使用分子描述符和由DataRobot选择的传统机器学习算法的基于分子描述符的方法,相关系数( )和均方根误差(RMSE)分别为0.625 - 0.669和0.295 - 0.318。我们将分子描述符和DeepSnap-DL的预测概率作为特征进行组合,开发了一种名为组合模型的新型回归方法。在具有这两种类型特征和由DataRobot选择的传统算法的组合模型中, 和RMSE分别为0.710 - 0.769和0.247 - 0.278。这一发现表明组合模型比基于分子描述符的方法表现更好。我们的组合模型将有助于药物发现中更合理化合物的设计。这种方法不仅可能适用于大鼠CL,还可能适用于其他药代动力学、药理活性和毒性参数;因此,将其应用于其他参数可能有助于加速药物发现。