Lu Jing, Zhang Pin, Zou Xiao-Wen, Zhao Xiao-Qiang, Cheng Ke-Guang, Zhao Yi-Lei, Bi Yi, Zheng Ming-Yue, Luo Xiao-Min
School of Pharmacy, Key Laboratory of Molecular Pharmacology and Drug Evaluation (Yantai University), Ministry of Education, Collaborative Innovation Center of Advanced Drug Delivery System and Biotech Drugs in Universities of Shandong, Yantai University, Yantai, 264005. China.
Key Laboratory for the Chemistry and Molecular Engineering of Medicinal Resources (Guangxi Normal University), Ministry of Education of China, Guilin 541004. China.
Comb Chem High Throughput Screen. 2017;20(4):346-353. doi: 10.2174/1386207320666170217151826.
Chemical toxicity is an important reason for late-stage failure in drug R&D. However, it is time-consuming and expensive to identify the multiple toxicities of compounds using the traditional experiments. Thus, it is attractive to build an accurate prediction model for the toxicity profile of compounds.
In this study, we carried out a research on six types of toxicities: (I) Acute Toxicity; (II) Mutagenicity; (III) Tumorigenicity; (IV) Skin and Eye Irritation; (V) Reproductive Effects; (VI) Multiple Dose Effects, using local lazy learning (LLL) method for multi-label learning. 17,120 compounds were split into the training set and the test set as a ratio of 4:1 by using the Kennard-Stone algorithm. Four types of properties, including molecular fingerprints (ECFP_4 and FCFP_4), descriptors, and chemical-chemical-interactions, were adopted for model building.
The model 'ECFP_4+LLL' yielded the best performance for the test set, while balanced accuracy (BACC) reached 0.692, 0.691, 0.666, 0.680, 0.631, 0.599 for six types of toxicities, respectively. Furthermore, some essential toxicophores for six types of toxicities were identified by using the Laplacian-modified Bayesian model.
The accurate prediction model and the chemical toxicophores can provide some guidance for designing drugs with low toxicity.
化学毒性是药物研发后期失败的一个重要原因。然而,使用传统实验来识别化合物的多种毒性既耗时又昂贵。因此,构建一个准确的化合物毒性特征预测模型很有吸引力。
在本研究中,我们使用局部懒惰学习(LLL)方法进行多标签学习,对六种类型的毒性进行了研究:(I)急性毒性;(II)致突变性;(III)致癌性;(IV)皮肤和眼睛刺激性;(V)生殖效应;(VI)多剂量效应。使用肯纳德-斯通算法将17120种化合物按4:1的比例划分为训练集和测试集。模型构建采用了四种类型的属性,包括分子指纹(ECFP_4和FCFP_4)、描述符和化学-化学相互作用。
模型“ECFP_4+LLL”在测试集上表现最佳,六种类型毒性的平衡准确率(BACC)分别达到0.692、0.691、0.666、0.680、0.631、0.599。此外,使用拉普拉斯修正贝叶斯模型确定了六种类型毒性的一些关键毒理学基团。
准确的预测模型和化学毒理学基团可为低毒性药物设计提供一些指导。