Contrera Joseph F, Kruhlak Naomi L, Matthews Edwin J, Benz R Daniel
US Food and Drug Administration, Center for Drug Evaluation and Research, Office of Pharmaceutical Science, Informatics and Computational Safety Analysis Staff, 10903 New Hampshire Avenue, Silver Spring, MD 20993-0002, USA.
Regul Toxicol Pharmacol. 2007 Dec;49(3):172-82. doi: 10.1016/j.yrtph.2007.07.001. Epub 2007 Jul 17.
This report presents a comparison of the predictive performance of MC4PC and MDL-QSAR software as well as a method for combining the predictions from both programs to increase overall accuracy. The conclusions are based on 10 x 10% leave-many-out internal cross-validation studies using 1540 training set compounds with 2-year rodent carcinogenicity findings. The models were generated using the same weight of evidence scoring method previously developed [Matthews, E.J., Contrera, J.F., 1998. A new highly specific method for predicting the carcinogenic potential of pharmaceuticals in rodents using enhanced MCASE QSAR-ES software. Regul. Toxicol. Pharmacol. 28, 242-264.]. Although MC4PC and MDL-QSAR use different algorithms, their overall predictive performance was remarkably similar. Respectively, the sensitivity of MC4PC and MDL-QSAR was 61 and 63%, specificity was 71 and 75%, and concordance was 66 and 69%. Coverage for both programs was over 95% and receiver operator characteristic (ROC) intercept statistic values were above 2.00. The software programs had complimentary coverage with none of the 1540 compounds being uncovered by both MC4PC and MDL-QSAR. Merging MC4PC and MDL-QSAR predictions improved the overall predictive performance. Consensus sensitivity increased to 67%, specificity to 84%, concordance to 76%, and ROC to 4.31. Consensus rules can be tuned to reflect the priorities of the user, so that greater emphasis may be placed on predictions with high sensitivity/low false negative rates or high specificity/low false positive rates. Sensitivity was optimized to 75% by reclassifying all compounds predicted to be positive in MC4PC or MDL-QSAR as positive, and specificity was optimized to 89% by reclassifying all compounds predicted negative in MC4PC or MDL-QSAR as negative.
本报告介绍了MC4PC和MDL-QSAR软件预测性能的比较,以及一种结合两个程序的预测结果以提高总体准确性的方法。结论基于使用1540种具有2年啮齿动物致癌性研究结果的训练集化合物进行的10×10%多组留出法内部交叉验证研究。模型使用先前开发的相同证据权重评分方法生成[Matthews, E.J., Contrera, J.F., 1998. 一种使用增强型MCASE QSAR-ES软件预测啮齿动物中药物致癌潜力的新型高特异性方法。Regul. Toxicol. Pharmacol. 28, 242-264.]。虽然MC4PC和MDL-QSAR使用不同的算法,但它们的总体预测性能非常相似。MC4PC和MDL-QSAR的灵敏度分别为61%和63%,特异性分别为71%和75%,一致性分别为66%和69%。两个程序的覆盖率均超过95%,受试者工作特征(ROC)截距统计值均高于2.00。这两个软件程序具有互补的覆盖范围,1540种化合物中没有一种被MC4PC和MDL-QSAR都遗漏。合并MC4PC和MDL-QSAR的预测结果可提高总体预测性能。共识灵敏度提高到67%,特异性提高到84%,一致性提高到76%,ROC提高到4.31。共识规则可以进行调整以反映用户的优先级,从而可以更加重视具有高灵敏度/低假阴性率或高特异性/低假阳性率的预测。通过将MC4PC或MDL-QSAR中预测为阳性的所有化合物重新分类为阳性,灵敏度优化到75%,通过将MC4PC或MDL-QSAR中预测为阴性的所有化合物重新分类为阴性,特异性优化到89%。