比较研究 QSPR 模型外部验证的一些指标。

Comparative studies on some metrics for external validation of QSPR models.

机构信息

Drug Theoretics and Cheminformatics Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India.

出版信息

J Chem Inf Model. 2012 Feb 27;52(2):396-408. doi: 10.1021/ci200520g. Epub 2012 Jan 17.

DOI:10.1021/ci200520g

PMID:22201416

Abstract

Quantitative structure-property relationship (QSPR) models used for prediction of property of untested chemicals can be utilized for prioritization plan of synthesis and experimental testing of new compounds. Validation of QSPR models plays a crucial role for judgment of the reliability of predictions of such models. In the QSPR literature, serious attention is now given to external validation for checking reliability of QSPR models, and predictive quality is in the most cases judged based on the quality of predictions of property of a single test set as reflected in one or more external validation metrics. Here, we have shown that a single QSPR model may show a variable degree of prediction quality as reflected in some variants of external validation metrics like Q²(F1), Q²(F2), Q²(F3), CCC, and r²(m) (all of which are differently modified forms of predicted variance, which theoretically may attain a maximum value of 1), depending on the test set composition and test set size. Thus, this report questions the appropriateness of the common practice of the "classic" approach of external validation based on a single test set and thereby derives a conclusion about predictive quality of a model on the basis of a particular validation metric. The present work further demonstrates that among the considered external validation metrics, r²(m) shows statistically significantly different numerical values from others among which CCC is the most optimistic or less stringent. Furthermore, at a given level of threshold value of acceptance for external validation metrics, r²(m) provides the most stringent criterion (especially with Δr²(m) at highest tolerated value of 0.2) of external validation, which may be adopted in the case of regulatory decision support processes.

摘要

定量构效关系（QSPR）模型可用于预测未经测试的化学品的性质，从而为新化合物的合成和实验测试制定优先级计划。QSPR 模型的验证对于判断此类模型预测的可靠性起着至关重要的作用。在 QSPR 文献中，现在非常重视外部验证，以检查 QSPR 模型的可靠性，并且预测质量在大多数情况下是基于对单个测试集的性质预测的质量来判断的，这反映在一个或多个外部验证指标中。在这里，我们已经表明，单个 QSPR 模型可能表现出不同程度的预测质量，这反映在外部验证指标的一些变体中，例如 Q²(F1)、Q²(F2)、Q²(F3)、CCC 和 r²(m)（所有这些都是预测方差的不同变体形式，理论上可能达到 1 的最大值），这取决于测试集的组成和测试集的大小。因此，本报告质疑了基于单个测试集的“经典”外部验证方法的常见做法的适当性，并因此根据特定的验证指标得出了模型预测质量的结论。本工作进一步表明，在所考虑的外部验证指标中，r²(m)与其他指标的数值存在统计学上的显著差异，其中 CCC 是最乐观或最宽松的。此外，在为外部验证指标接受的给定阈值水平下，r²(m)提供了最严格的标准（尤其是在可接受的最大Δr²(m)值为 0.2 的情况下），这可在监管决策支持过程中采用。

相似文献

Comparative studies on some metrics for external validation of QSPR models.

J Chem Inf Model. 2012 Feb 27;52(2):396-408. doi: 10.1021/ci200520g. Epub 2012 Jan 17.

Quantitative structure-property relationship for predicting chlorine demand by organic molecules.

Environ Sci Technol. 2010 Apr 1;44(7):2503-8. doi: 10.1021/es903164d.

Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection.

J Chem Inf Model. 2012 Aug 27;52(8):2044-58. doi: 10.1021/ci300084j. Epub 2012 Jul 13.

Some case studies on application of "r(m)2" metrics for judging quality of quantitative structure-activity relationship predictions: emphasis on scaling of response data.

J Comput Chem. 2013 May 5;34(12):1071-82. doi: 10.1002/jcc.23231. Epub 2013 Jan 8.

Statistical external validation and consensus modeling: a QSPR case study for Koc prediction.

J Mol Graph Model. 2007 Mar;25(6):755-66. doi: 10.1016/j.jmgm.2006.06.005. Epub 2006 Aug 4.

QSPR study of Setschenow constants of organic compounds using MLR, ANN, and SVM analyses.

J Comput Chem. 2011 Nov 30;32(15):3241-52. doi: 10.1002/jcc.21907. Epub 2011 Aug 12.

Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient.

J Chem Inf Model. 2011 Sep 26;51(9):2320-35. doi: 10.1021/ci200211n. Epub 2011 Aug 12.

New QSPR equations for prediction of aqueous solubility for military compounds.

Chemosphere. 2010 May;79(8):887-90. doi: 10.1016/j.chemosphere.2010.02.030. Epub 2010 Mar 16.

Prediction of aqueous solubility based on large datasets using several QSPR models utilizing topological structure representation.

Chem Biodivers. 2004 Nov;1(11):1829-41. doi: 10.1002/cbdv.200490137.

Screening of persistent organic pollutants by QSPR classification models: a comparative study.

J Mol Graph Model. 2008 Aug;27(1):59-65. doi: 10.1016/j.jmgm.2008.02.004. Epub 2008 Mar 4.

引用本文的文献

Predicting flavonoid physicochemical properties using topological indices and regression modeling.

Sci Rep. 2025 Jul 29;15(1):27540. doi: 10.1038/s41598-025-11084-w.

Development of hybrid models by the integration of the read-across hypothesis with the QSAR framework for the assessment of developmental and reproductive toxicity (DART) tested according to OECD TG 414.

Toxicol Rep. 2024 Nov 19;13:101822. doi: 10.1016/j.toxrep.2024.101822. eCollection 2024 Dec.

Development of Novel ROCK Inhibitors via 3D-QSAR and Molecular Docking Studies: A Framework for Multi-Target Drug Design.

Pharmaceutics. 2024 Sep 26;16(10):1250. doi: 10.3390/pharmaceutics16101250.

Quantitative Structure-Activity Relationship in the Series of 5-Ethyluridine, N2-Guanine, and 6-Oxopurine Derivatives with Pronounced Anti-Herpetic Activity.

Molecules. 2023 Nov 22;28(23):7715. doi: 10.3390/molecules28237715.

Preclinical Evaluation of an Imidazole-Linked Heterocycle for Alzheimer's Disease.

Pharmaceutics. 2023 Sep 25;15(10):2381. doi: 10.3390/pharmaceutics15102381.

In-silico activity prediction and docking studies of some flavonol derivatives as anti-prostate cancer agents based on Monte Carlo optimization.

BMC Chem. 2023 Jul 26;17(1):87. doi: 10.1186/s13065-023-00999-y.

A QSAR Study for Antileishmanial 2-Phenyl-2,3-dihydrobenzofurans .

Molecules. 2023 Apr 12;28(8):3399. doi: 10.3390/molecules28083399.

3D-QSAR Studies, Molecular Docking, Molecular Dynamic Simulation, and ADMET Proprieties of Novel Pteridinone Derivatives as PLK1 Inhibitors for the Treatment of Prostate Cancer.

Life (Basel). 2023 Jan 2;13(1):127. doi: 10.3390/life13010127.

Exploring the inhibitory mechanisms of indazole compounds against SAH/MTAN-mediated quorum sensing utilizing QSAR and docking.

Drug Target Insights. 2022 Dec 22;16:54-68. doi: 10.33393/dti.2022.2512. eCollection 2022 Jan-Dec.

Development of Ion Character Property Relationship (IC-PR) for Removal of 13-Metal Ions by Employing a Novel Green Adsorbent .

Molecules. 2022 Nov 25;27(23):8213. doi: 10.3390/molecules27238213.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

比较研究 QSPR 模型外部验证的一些指标。

Comparative studies on some metrics for external validation of QSPR models.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献