Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China.
J Appl Toxicol. 2022 Oct;42(10):1639-1650. doi: 10.1002/jat.4331. Epub 2022 Apr 29.
In recent years, drug-induced nephrotoxicity has been one of the main reasons for the failure of drug development. Early prediction of the nephrotoxicity for drug candidates is critical to the success of clinical trials. Therefore, it is very important to construct an effective model that can predict the potential nephrotoxicity of compounds. Machine learning methods have been widely used to predict the physicochemical properties, biological activities, and safety assessment of compounds. In this study, we manually collected 777 valid drug data and constructed a total of 72 classification models using nine types of molecular fingerprints combined with different machine learning algorithms. From experimental literature and the US FDA Drugs Database, some marketed drugs were screened for external validation of the models. Finally, three models exhibited good performance in the prediction of nephrotoxicity of both chemical drugs and Chinese herbal medicines. The best model was the support vector machine algorithm combined with CDK graph only fingerprint. Furthermore, the applicability domain of the models was analyzed according to the OECD principles, and we also used the SARpy and information gain methods to find eight substructures that might cause nephrotoxicity, so as to attract attention in the future drug discovery.
近年来,药物诱导的肾毒性是药物开发失败的主要原因之一。早期预测候选药物的肾毒性对于临床试验的成功至关重要。因此,构建一个能够预测化合物潜在肾毒性的有效模型非常重要。机器学习方法已广泛用于预测化合物的物理化学性质、生物活性和安全性评估。在这项研究中,我们手动收集了 777 条有效药物数据,并使用九种分子指纹图结合不同的机器学习算法构建了总共 72 个分类模型。从实验文献和美国 FDA 药物数据库中筛选了一些上市药物,对模型进行外部验证。最后,三种模型在化学药物和中药的肾毒性预测方面表现出良好的性能。最好的模型是仅结合 CDK 图指纹的支持向量机算法。此外,根据 OECD 原则分析了模型的适用域,我们还使用 SARpy 和信息增益方法找到了可能导致肾毒性的八个亚结构,以便在未来的药物发现中引起关注。