Ren Shijin
Center for Environmental Biotechnology, 676 Dabney Hall, University of Tennessee, Knoxville, Tennessee 37996-1605, USA.
J Chem Inf Comput Sci. 2003 Sep-Oct;43(5):1679-87. doi: 10.1021/ci034046y.
Response surface models based on multiple linear regression had previously been developed for the toxicity of aromatic chemicals to Tetrahymena pyriformis. However, a nonlinear relationship between toxicity and one of the molecular descriptors in the response surface model was observed. In this study, response surface models were established using six nonlinear modeling methods to handle the nonlinearity exhibited in the aromatic chemicals data set. All models were validated using the method of cross-validation, and prediction accuracy was tested on an external data set. Results showed that response surface models based on locally weighted regression scatter plot smoothing (LOESS), multivariate adaptive regression splines (MARS), neural networks (NN), and projection pursuit regression (PPR) provided satisfactory power of model fitting and prediction and had similar applicabilities. The response surface models based on nonlinear methods were difficult to interpret and conservative in discriminating toxicity mechanisms.
基于多元线性回归的响应面模型先前已针对芳香族化学品对梨形四膜虫的毒性进行了开发。然而,在响应面模型中观察到毒性与其中一个分子描述符之间存在非线性关系。在本研究中,使用六种非线性建模方法建立了响应面模型,以处理芳香族化学品数据集中呈现的非线性。所有模型均使用交叉验证方法进行验证,并在外部数据集上测试预测准确性。结果表明,基于局部加权回归散点图平滑法(LOESS)、多元自适应回归样条法(MARS)、神经网络(NN)和投影寻踪回归法(PPR)的响应面模型具有令人满意的模型拟合和预测能力,且具有相似的适用性。基于非线性方法的响应面模型难以解释,并且在区分毒性机制方面较为保守。