Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Raja S C Mullick Road, Kolkata 700032, India.
J Hazard Mater. 2010 May 15;177(1-3):344-51. doi: 10.1016/j.jhazmat.2009.12.038. Epub 2009 Dec 31.
One of the major economic alternatives to experimental toxicity testing is the use of quantitative structure-activity relationships (QSARs) which are used in formulating regulatory decisions of environmental protection agencies. In this background, we have modeled a large diverse group of 297 chemicals for their toxicity to Daphnia magna using mechanistically interpretable descriptors. Three-dimensional (3D) (electronic and spatial) and two-dimensional (2D) (topological and information content indices) descriptors along with physicochemical parameter logK(o/w) (n-octanol/water partition coefficient) and structural descriptors were used as predictor variables. The QSAR models were developed by stepwise multiple linear regression (MLR), partial least squares (PLS), genetic function approximation (GFA), and genetic PLS (G/PLS). All the models were validated internally and externally. Among several models developed using different chemometric tools, the best model based on both internal and external validation characteristics was a PLS equation with 7 descriptors and three latent variables explaining 67.8% leave-one-out predicted variance and 74.1% external predicted variance. The PLS model suggests that higher lipophilicity and electrophilicity, less negative charge surface area and presence of ether linkage, hydrogen bond donor groups and acetylenic carbons are responsible for greater toxicity of chemicals. The developed model may be used for prediction of toxicity, safety and risk assessment of chemicals to achieve better ecotoxicological management and prevent adverse health consequences.
实验毒性测试的一种主要经济替代方法是使用定量构效关系(QSAR),这在制定环境保护机构的监管决策中被广泛应用。在这种背景下,我们使用可解释机制的描述符对 297 种化学物质对大型溞的毒性进行了建模。三维(3D)(电子和空间)和二维(2D)(拓扑和信息内容指数)描述符以及物理化学参数 logK(o/w)(正辛醇/水分配系数)和结构描述符被用作预测变量。QSAR 模型是通过逐步多元线性回归(MLR)、偏最小二乘(PLS)、遗传函数逼近(GFA)和遗传 PLS(G/PLS)来开发的。所有模型都进行了内部和外部验证。在使用不同化学计量工具开发的多个模型中,基于内部和外部验证特性的最佳模型是一个具有 7 个描述符和三个潜在变量的 PLS 方程,解释了 67.8%的留一外预测方差和 74.1%的外部预测方差。PLS 模型表明,较高的亲脂性和亲电性、较少的负电荷表面积以及醚键、氢键供体基团和炔碳的存在是导致化学物质毒性增加的原因。开发的模型可用于预测化学物质的毒性、安全性和风险评估,以实现更好的生态毒理学管理并预防不良的健康后果。