Ding Yi-Lung, Lyu You-Chen, Leong Max K
Department of Chemistry, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan.
Department of Chemistry, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan; Department of Life Science and Institute of Biotechnology, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan.
Toxicol In Vitro. 2017 Apr;40:102-114. doi: 10.1016/j.tiv.2016.12.013. Epub 2016 Dec 24.
Certain drugs are nitroaromatic compounds, which are potentially toxic. As such, it is of practical importance to assess and predict their mutagenic potency in the process of drug discovery. A classical quantitative structure-activity relationship (QSAR) model was developed using the linear partial least square (PLS) scheme to understand the underline mutagenic mechanism and a non-classical QSAR model was derived using the machine learning-based hierarchical support vector regression (HSVR) to predict the mutagenicity of nitroaromatic compounds based on a series of mutagenicity data (TA98-S9). It was observed that HSVR performed better than PLS as manifested by the predictions of the samples in the training set, test set, and outlier set as well as various statistical validations. A mock test designated to mimic real challenges also confirmed the better performance of HSVR. Furthermore, HSVR exhibited superiority in predictivity, generalization capabilities, consistent performance, and robustness when compared with various published predictive models. PLS, conversely, revealed some mechanistically interpretable relationships between descriptors and mutagenicity. Thus, this two-QSAR approach using the predictive HSVR and interpretable PLS models in a synergistic fashion can be adopted to facilitate drug discovery and development by designing safer drug candidates with nitroaromatic moiety.
某些药物是硝基芳香族化合物,具有潜在毒性。因此,在药物发现过程中评估和预测它们的致突变潜力具有实际重要性。使用线性偏最小二乘法(PLS)建立了一个经典的定量构效关系(QSAR)模型,以了解潜在的致突变机制,并基于一系列致突变性数据(TA98-S9),使用基于机器学习的分层支持向量回归(HSVR)推导了一个非经典的QSAR模型,以预测硝基芳香族化合物的致突变性。结果发现,从训练集、测试集和异常值集的样本预测以及各种统计验证来看,HSVR的表现优于PLS。一个旨在模拟实际挑战的模拟测试也证实了HSVR的更好性能。此外,与各种已发表的预测模型相比,HSVR在预测性能力、泛化能力、一致性能和稳健性方面表现出优势。相反,PLS揭示了描述符与致突变性之间一些可从机制上解释的关系。因此,这种以协同方式使用预测性HSVR和可解释PLS模型的双QSAR方法,可通过设计具有硝基芳香族部分的更安全候选药物,来促进药物发现和开发。