• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

谨防R(2):对定量构效关系和定量构性关系模型预测准确性的简单、明确评估。

Beware of R(2): Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models.

作者信息

Alexander D L J, Tropsha A, Winkler David A

机构信息

†CSIRO Digital Productivity Flagship, Private Bag 10, Clayton South, VIC 3169, Australia.

‡UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States.

出版信息

J Chem Inf Model. 2015 Jul 27;55(7):1316-22. doi: 10.1021/acs.jcim.5b00206. Epub 2015 Jul 9.

DOI:10.1021/acs.jcim.5b00206
PMID:26099013
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4530125/
Abstract

The statistical metrics used to characterize the external predictivity of a model, i.e., how well it predicts the properties of an independent test set, have proliferated over the past decade. This paper clarifies some apparent confusion over the use of the coefficient of determination, R(2), as a measure of model fit and predictive power in QSAR and QSPR modeling. R(2) (or r(2)) has been used in various contexts in the literature in conjunction with training and test data for both ordinary linear regression and regression through the origin as well as with linear and nonlinear regression models. We analyze the widely adopted model fit criteria suggested by Golbraikh and Tropsha ( J. Mol. Graphics Modell. 2002 , 20 , 269 - 276 ) in a strict statistical manner. Shortcomings in these criteria are identified, and a clearer and simpler alternative method to characterize model predictivity is provided. The intent is not to repeat the well-documented arguments for model validation using test data but rather to guide the application of R(2) as a model fit statistic. Examples are used to illustrate both correct and incorrect uses of R(2). Reporting the root-mean-square error or equivalent measures of dispersion, which are typically of more practical importance than R(2), is also encouraged, and important challenges in addressing the needs of different categories of users such as computational chemists, experimental scientists, and regulatory decision support specialists are outlined.

摘要

在过去十年中,用于表征模型外部预测能力(即其预测独立测试集性质的能力)的统计指标大量涌现。本文澄清了在定量构效关系(QSAR)和定量构性关系(QSPR)建模中,将决定系数R²用作模型拟合度和预测能力度量时出现的一些明显混淆。R²(或r²)在文献中的各种语境下,与普通线性回归及过原点回归的训练数据和测试数据一起使用,也与线性和非线性回归模型一起使用。我们以严格的统计方式分析了Golbraikh和Tropsha(《分子图形与建模杂志》,2002年,20卷,269 - 276页)提出的被广泛采用的模型拟合标准。确定了这些标准中的缺点,并提供了一种更清晰、更简单的表征模型预测能力的替代方法。目的不是重复使用测试数据进行模型验证的那些有充分记录的论据,而是指导将R²用作模型拟合统计量的应用。通过示例来说明R²的正确和错误用法。还鼓励报告均方根误差或等效的离散度度量,它们通常比R²更具实际重要性,并概述了在满足计算化学家、实验科学家和监管决策支持专家等不同类别用户需求方面的重大挑战。

相似文献

1
Beware of R(2): Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models.谨防R(2):对定量构效关系和定量构性关系模型预测准确性的简单、明确评估。
J Chem Inf Model. 2015 Jul 27;55(7):1316-22. doi: 10.1021/acs.jcim.5b00206. Epub 2015 Jul 9.
2
Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient.QSAR 模型的真实外部预测能力:如何评估?不同验证标准的比较及使用一致性相关系数的建议。
J Chem Inf Model. 2011 Sep 26;51(9):2320-35. doi: 10.1021/ci200211n. Epub 2011 Aug 12.
3
Rational selection of training and test sets for the development of validated QSAR models.为开发经过验证的定量构效关系(QSAR)模型合理选择训练集和测试集。
J Comput Aided Mol Des. 2003 Feb-Apr;17(2-4):241-53. doi: 10.1023/a:1025386326946.
4
Combinatorial QSAR of ambergris fragrance compounds.龙涎香香料化合物的组合定量构效关系
J Chem Inf Comput Sci. 2004 Mar-Apr;44(2):582-95. doi: 10.1021/ci034203t.
5
Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection.QSAR 模型的实际外部预测能力。第 2 部分。不同验证标准的新可比阈值以及需要进行散点图检查。
J Chem Inf Model. 2012 Aug 27;52(8):2044-58. doi: 10.1021/ci300084j. Epub 2012 Jul 13.
6
The rm2 metrics and regression through origin approach: reliable and useful validation tools for predictive QSAR models (Commentary on 'Is regression through origin useful in external validation of QSAR models?').rm2指标与过原点回归方法:预测性QSAR模型的可靠且有用的验证工具(关于“过原点回归在QSAR模型外部验证中有用吗?”的评论)
Eur J Pharm Sci. 2014 Oct 1;62:111-4. doi: 10.1016/j.ejps.2014.05.019. Epub 2014 May 29.
7
Combinatorial QSAR modeling of P-glycoprotein substrates.P-糖蛋白底物的组合定量构效关系建模
J Chem Inf Model. 2006 May-Jun;46(3):1245-54. doi: 10.1021/ci0504317.
8
Is regression through origin useful in external validation of QSAR models?通过原点回归在QSAR模型的外部验证中有用吗?
Eur J Pharm Sci. 2014 Aug 1;59:31-5. doi: 10.1016/j.ejps.2014.03.007. Epub 2014 Apr 8.
9
Predictive QSAR modeling of HIV reverse transcriptase inhibitor TIBO derivatives.HIV逆转录酶抑制剂替博(TIBO)衍生物的预测性定量构效关系建模
Eur J Med Chem. 2009 Apr;44(4):1509-24. doi: 10.1016/j.ejmech.2008.07.020. Epub 2008 Jul 24.
10
Comparative studies on some metrics for external validation of QSPR models.比较研究 QSPR 模型外部验证的一些指标。
J Chem Inf Model. 2012 Feb 27;52(2):396-408. doi: 10.1021/ci200520g. Epub 2012 Jan 17.

引用本文的文献

1
Global trends in shoulder dislocation research: A bibliometric review with hotspot visualization.肩关节脱位研究的全球趋势:一项文献计量学综述与热点可视化分析
Medicine (Baltimore). 2025 Aug 22;104(34):e43652. doi: 10.1097/MD.0000000000043652.
2
QSAR Models for Predicting the Antioxidant Potential of Chemical Substances.用于预测化学物质抗氧化潜力的定量构效关系模型
J Xenobiot. 2025 May 25;15(3):80. doi: 10.3390/jox15030080.
3
Mediational effect analysis of childhood emotional abuse on prodromal psychotic symptoms in self-taught examination students.

本文引用的文献

1
Best Practices for QSAR Model Development, Validation, and Exploitation.定量构效关系(QSAR)模型开发、验证及应用的最佳实践
Mol Inform. 2010 Jul 12;29(6-7):476-88. doi: 10.1002/minf.201000061. Epub 2010 Jul 6.
2
The rm2 metrics and regression through origin approach: reliable and useful validation tools for predictive QSAR models (Commentary on 'Is regression through origin useful in external validation of QSAR models?').rm2指标与过原点回归方法:预测性QSAR模型的可靠且有用的验证工具(关于“过原点回归在QSAR模型外部验证中有用吗?”的评论)
Eur J Pharm Sci. 2014 Oct 1;62:111-4. doi: 10.1016/j.ejps.2014.05.019. Epub 2014 May 29.
3
自学考试学生童年期情感虐待对前驱精神病性症状的中介效应分析
Sci Rep. 2025 Jun 6;15(1):19932. doi: 10.1038/s41598-025-05062-5.
4
Selection for Early Reproduction Leads to Accelerated Aging and Extensive Metabolic Remodeling in Drosophila melanogaster.早期繁殖选择导致黑腹果蝇加速衰老和广泛的代谢重塑。
Genome Biol Evol. 2025 Apr 30;17(5). doi: 10.1093/gbe/evaf082.
5
Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.利用化学结构进行毒性预测的机器学习:在现实世界中取得成功的支柱。
Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.
6
Design, synthesis, and in vitro evaluation of a carbamazepine derivative with antitumor potential in a model of Acute Lymphoblastic Leukemia.一种在急性淋巴细胞白血病模型中具有抗肿瘤潜力的卡马西平衍生物的设计、合成及体外评估。
PLoS One. 2025 Apr 28;20(4):e0319415. doi: 10.1371/journal.pone.0319415. eCollection 2025.
7
A quantum mechanics-based framework for infectious disease modeling.一种基于量子力学的传染病建模框架。
Sci Rep. 2025 Apr 12;15(1):12602. doi: 10.1038/s41598-025-96817-7.
8
The Central Composite Design and Artificial Neural Network Coupled with Genetic Algorithm in Optimization and Modeling of the Radiolabeling Process of Lu-hydroxyapatite as a Potential Radiosynovectomy Agent.中心复合设计与人工神经网络结合遗传算法在优化和建模潜在放射性滑膜切除剂Lu-羟基磷灰石放射性标记过程中的应用
Curr Radiopharm. 2025;18(3):201-215. doi: 10.2174/0118744710336283250227020659.
9
Temporal autocorrelation is predictive of age-An extensive MEG time-series analysis.时间自相关可预测年龄——一项广泛的脑磁图时间序列分析。
Proc Natl Acad Sci U S A. 2025 Feb 25;122(8):e2411098122. doi: 10.1073/pnas.2411098122. Epub 2025 Feb 20.
10
Evolving Dynamics of Colorectal Cancer in High Socio-Demographic Regions.高社会人口统计学区域结直肠癌的动态演变
Cancer Control. 2025 Jan-Dec;32:10732748251321672. doi: 10.1177/10732748251321672.
Is regression through origin useful in external validation of QSAR models?
通过原点回归在QSAR模型的外部验证中有用吗?
Eur J Pharm Sci. 2014 Aug 1;59:31-5. doi: 10.1016/j.ejps.2014.03.007. Epub 2014 Apr 8.
4
Some case studies on application of "r(m)2" metrics for judging quality of quantitative structure-activity relationship predictions: emphasis on scaling of response data.一些关于“r(m)2”指标在判断定量构效关系预测质量中的应用的案例研究:重点是响应数据的定标。
J Comput Chem. 2013 May 5;34(12):1071-82. doi: 10.1002/jcc.23231. Epub 2013 Jan 8.
5
Quantitative structure-property relationship modeling of diverse materials properties.多种材料属性的定量结构-属性关系建模
Chem Rev. 2012 May 9;112(5):2889-919. doi: 10.1021/cr200066h. Epub 2012 Jan 17.
6
Bayesian regularization of neural networks.神经网络的贝叶斯正则化
Methods Mol Biol. 2008;458:25-44. doi: 10.1007/978-1-60327-101-1_3.
7
Trends and plot methods in MLR studies.
J Chem Inf Model. 2007 May-Jun;47(3):751-60. doi: 10.1021/ci6004959. Epub 2007 Apr 25.
8
Exhaustive QSPR studies of a large diverse set of ionic liquids: how accurately can we predict melting points?对大量不同种类离子液体进行的详尽定量构效关系研究:我们能多准确地预测熔点?
J Chem Inf Model. 2007 May-Jun;47(3):1111-22. doi: 10.1021/ci600493x. Epub 2007 Mar 24.
9
Beware of q2!小心q2!
J Mol Graph Model. 2002 Jan;20(4):269-76. doi: 10.1016/s1093-3263(01)00123-1.
10
Are free energy calculations useful in practice? A comparison with rapid scoring functions for the p38 MAP kinase protein system.
J Med Chem. 2001 Oct 11;44(21):3417-23. doi: 10.1021/jm0100279.