Zhu Hao, Tropsha Alexander, Fourches Denis, Varnek Alexandre, Papa Ester, Gramatica Paola, Oberg Tomas, Dao Phuong, Cherkasov Artem, Tetko Igor V
Laboratory for Molecular Modeling, Division of Medicinal Chemistry and Natural Products, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
J Chem Inf Model. 2008 Apr;48(4):766-84. doi: 10.1021/ci700443v. Epub 2008 Mar 1.
Selecting most rigorous quantitative structure-activity relationship (QSAR) approaches is of great importance in the development of robust and predictive models of chemical toxicity. To address this issue in a systematic way, we have formed an international virtual collaboratory consisting of six independent groups with shared interests in computational chemical toxicology. We have compiled an aqueous toxicity data set containing 983 unique compounds tested in the same laboratory over a decade against Tetrahymena pyriformis. A modeling set including 644 compounds was selected randomly from the original set and distributed to all groups that used their own QSAR tools for model development. The remaining 339 compounds in the original set (external set I) as well as 110 additional compounds (external set II) published recently by the same laboratory (after this computational study was already in progress) were used as two independent validation sets to assess the external predictive power of individual models. In total, our virtual collaboratory has developed 15 different types of QSAR models of aquatic toxicity for the training set. The internal prediction accuracy for the modeling set ranged from 0.76 to 0.93 as measured by the leave-one-out cross-validation correlation coefficient ( Q abs2). The prediction accuracy for the external validation sets I and II ranged from 0.71 to 0.85 (linear regression coefficient R absI2) and from 0.38 to 0.83 (linear regression coefficient R absII2), respectively. The use of an applicability domain threshold implemented in most models generally improved the external prediction accuracy but at the same time led to a decrease in chemical space coverage. Finally, several consensus models were developed by averaging the predicted aquatic toxicity for every compound using all 15 models, with or without taking into account their respective applicability domains. We find that consensus models afford higher prediction accuracy for the external validation data sets with the highest space coverage as compared to individual constituent models. Our studies prove the power of a collaborative and consensual approach to QSAR model development. The best validated models of aquatic toxicity developed by our collaboratory (both individual and consensus) can be used as reliable computational predictors of aquatic toxicity and are available from any of the participating laboratories.
选择最严格的定量构效关系(QSAR)方法对于开发可靠且具有预测性的化学毒性模型至关重要。为了系统地解决这一问题,我们组建了一个国际虚拟合作团队,由六个对计算化学毒理学有共同兴趣的独立小组组成。我们汇编了一个水相毒性数据集,其中包含在十年间于同一实验室针对梨形四膜虫测试的983种独特化合物。从原始数据集中随机选择了一个包含644种化合物的建模集,并分发给所有使用各自QSAR工具进行模型开发的小组。原始数据集中剩余的339种化合物(外部集I)以及同一实验室最近发表的另外110种化合物(外部集II)(在本计算研究已经进行之后)被用作两个独立的验证集,以评估各个模型的外部预测能力。我们的虚拟合作团队总共为训练集开发了15种不同类型的水生毒性QSAR模型。通过留一法交叉验证相关系数(Q abs2)衡量,建模集的内部预测准确率在0.76至0.93之间。外部验证集I和II的预测准确率分别在0.71至0.85(线性回归系数R absI2)和0.38至0.83(线性回归系数R absII2)之间。大多数模型中实施的适用域阈值的使用通常提高了外部预测准确率,但同时导致化学空间覆盖率下降。最后,通过使用所有15个模型对每种化合物的预测水生毒性进行平均,开发了几个共识模型,无论是否考虑其各自的适用域。我们发现,与各个组成模型相比,共识模型为外部验证数据集提供了更高的预测准确率,且具有最高的空间覆盖率。我们的研究证明了合作和共识方法在QSAR模型开发中的力量。我们合作团队开发的经过最佳验证的水生毒性模型(包括个体模型和共识模型)可作为可靠的水生毒性计算预测工具,可从任何一个参与实验室获取。