Huuskonen J
Department of Pharmacy, University of Helsinki, Finland.
Environ Toxicol Chem. 2001 Oct;20(10):2152-7.
A group contribution method based on atom-type electrotopological state indices for predicting the biodegradation of a diverse set of 241 organic chemicals is presented. Multiple linear regression and artificial neural networks were used to build the models using a training set of 172 compounds, for which the approximate time for ultimate biodegradation was estimated from the results of a survey of an expert panel. Derived models were validated by using a leave-25%-out method and against two test sets of 12 and 57 chemicals not included in the training set. The squared correlation coefficient (r2) for a linear model with 15 structural parameters was 0.76 for the training set and 0.68 for the test set of 12 molecules. The model predicted correctly the biodegradation of 48 chemicals in the test set of 57 molecules, for which biodegradability was presented as rapid or slow. The use of artificial neural networks gave better prediction for both test sets when the same set of parameters was tested as inputs in neural network simulations. The predictions of rapidly biodegradable chemicals were more accurate than the predictions of slowly biodegradable chemicals for both the regression and neural network models.
提出了一种基于原子类型电拓扑状态指数的基团贡献法,用于预测241种不同有机化学品的生物降解性。使用172种化合物的训练集,通过多元线性回归和人工神经网络建立模型,其中最终生物降解的近似时间是根据专家小组的调查结果估算得出的。通过留25%法并针对训练集中未包含的两组分别为12种和57种化学品的测试集对所得模型进行验证。具有15个结构参数的线性模型,其训练集的平方相关系数(r2)为0.76,12个分子的测试集的平方相关系数为0.68。该模型正确预测了57个分子的测试集中48种化学品的生物降解性,其中生物降解性表现为快速或缓慢。当在神经网络模拟中测试同一组参数作为输入时,使用人工神经网络对两个测试集都能给出更好的预测。对于回归模型和神经网络模型,快速生物降解化学品的预测都比缓慢生物降解化学品的预测更准确。