Xiangya School of Pharmaceutical Sciences , Central South University , Changsha , People's Republic of China.
Hunan Key Laboratory of Processed Food for Special Medical Purpose Central South University of Forestry and Technology , Changsha 410004 , People's Republic of China.
Mol Pharm. 2019 Jan 7;16(1):393-408. doi: 10.1021/acs.molpharmaceut.8b01048. Epub 2018 Dec 14.
Hepatotoxicity is a major cause of drug withdrawal from the market. To reduce the drug attrition induced by hepatotoxicity, an accurate and efficient hepatotoxicity prediction system must be constructed. In the present study, we constructed a three-level hepatotoxicity prediction system based on different levels of adverse hepatic effects (AHEs) combined with machine learning, using (1) an end point, hepatotoxicity; (2) four hepatotoxicity severity degrees; and (3) specific AHEs. After collecting and curing 15 873 compound-AHE pairs associated with 2017 compounds and 403 AHEs, we constructed 27 models with three end point levels with the random forest algorithm, and obtained accuracies ranging from 67.0 to 78.2% and the area under receiver operating characteristic curves (AUCs) of 0.715-0.875. The 27 models were fully integrated into a tiered hepatotoxicity prediction system. The existence of hepatotoxicity existence, severity degree, and potential AHEs for a given compound could be inferred simultaneously and systematically. Thus, the tiered hepatotoxicity prediction system allows researchers to have significant confidence in confirming compound hepatotoxicity, analyzing hepatotoxicity from multiple perspectives, obtaining warnings for the potential hepatotoxicity severity, and even rapidly selecting the proper in vitro experiments for hepatotoxicity verification. We also applied three external sets (11 drugs or candidates that failed in clinical trials or were withdrawn from the market, the PharmGKB (offsides) database, and an herbal hepatotoxicity data set) to test and validate the prediction ability of our system. Furthermore, the hepatotoxicity prediction system was adapted into a flow framework based on the Konstanz Information Miner, which was made available for researchers.
肝毒性是导致药物退出市场的主要原因。为了降低肝毒性导致的药物淘汰率,必须构建一个准确、高效的肝毒性预测系统。在本研究中,我们构建了一个基于不同水平的肝毒性不良反应(AHEs)的三级肝毒性预测系统,结合机器学习,使用(1)终点,肝毒性;(2)四种肝毒性严重程度;和(3)特定的 AHEs。在收集和治疗了 15873 对与 2017 种化合物和 403 种 AHEs 相关的化合物-AHE 对后,我们使用随机森林算法构建了 27 个具有三个终点水平的模型,获得了 67.0%至 78.2%的准确率和 0.715-0.875 的接收器操作特征曲线下面积(AUCs)。这 27 个模型被完全整合到一个分层肝毒性预测系统中。可以同时和系统地推断给定化合物是否存在肝毒性、严重程度和潜在的 AHEs。因此,分层肝毒性预测系统允许研究人员在确认化合物肝毒性、从多个角度分析肝毒性、获得潜在肝毒性严重程度的警告,甚至快速选择适当的体外实验进行肝毒性验证方面具有显著的信心。我们还应用了三个外部数据集(11 种在临床试验中失败或退出市场的药物或候选药物、PharmGKB(offside)数据库和一个草药肝毒性数据集)来测试和验证我们系统的预测能力。此外,我们还将肝毒性预测系统改编成了基于 Konstanz Information Miner 的流程框架,供研究人员使用。