Cronin Mark T D, Netzeva Tatiana I, Dearden John C, Edwards Robert, Worgan Andrew D P
School of Pharmacy and Chemistry, Liverpool John Moores University, Byrom Street, Liverpool, L3 3AF, England.
Chem Res Toxicol. 2004 Apr;17(4):545-54. doi: 10.1021/tx0342518.
This study reports a database of toxicity values for 91 compounds assessed in a novel, rapid, and economical 15 min algal toxicity test. The toxicity data were measured using the unicellular green alga Chlorella vulgaris in an assay that determined the disappearance of fluorescein diacetate. The chemicals tested covered a wide range of physicochemical properties and mechanisms of action. Quantitative activity-activity relationships with the toxicity of the chemicals to other species (Tetrahymena pyriformis, Vibrio fischeri, and Pimephales promelas) showed strong relationships, although some differences resulting from different protocols were established. Quantitative structure-activity relationships (QSARs) were determined using linear [multiple linear regression (MLR)] and nonlinear [k-nearest neighbors (KNN)] methods. Three descriptors, accounting for hydrophobicity, electrophilicity, and a function of molecular size corrected for the presence of heteroatoms, were found to be important to model toxicity. The predictivity of MLR was compared to KNN using leave-one-out cross-validation and the simulation of an external test set. MLR demonstrated greater stability in validation. The results of this study showed that method selection in QSAR is task-dependent and it is inappropriate to resort to more complicated but less transparent methods, unless there are clear indications (e.g., inability of MLR to deal with the data set) for the need of such methods.
本研究报告了一个毒性值数据库,该数据库涵盖了在一项新颖、快速且经济的15分钟藻类毒性试验中评估的91种化合物。毒性数据是使用单细胞绿藻普通小球藻在一项测定荧光素二乙酸酯消失情况的试验中测得的。所测试的化学物质涵盖了广泛的物理化学性质和作用机制。与这些化学物质对其他物种(梨形四膜虫、费氏弧菌和美凤蝶)的毒性的定量活性-活性关系显示出很强的相关性,尽管由于不同的实验方案而存在一些差异。使用线性[多元线性回归(MLR)]和非线性[k近邻(KNN)]方法确定了定量构效关系(QSAR)。发现三个描述符,分别用于说明疏水性、亲电性以及针对杂原子存在情况校正后的分子大小函数,对毒性建模很重要。使用留一法交叉验证和外部测试集模拟将MLR的预测能力与KNN进行了比较。MLR在验证中表现出更高的稳定性。本研究结果表明,QSAR中的方法选择取决于任务,除非有明确迹象(例如,MLR无法处理数据集)表明需要使用更复杂但透明度较低的方法,否则采用此类方法是不合适的。