• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

QSAR 模型的实际外部预测能力。第 2 部分。不同验证标准的新可比阈值以及需要进行散点图检查。

Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection.

机构信息

QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Via Dunant 3, 21100, Varese, Italy.

出版信息

J Chem Inf Model. 2012 Aug 27;52(8):2044-58. doi: 10.1021/ci300084j. Epub 2012 Jul 13.

DOI:10.1021/ci300084j
PMID:22721530
Abstract

The evaluation of regression QSAR model performance, in fitting, robustness, and external prediction, is of pivotal importance. Over the past decade, different external validation parameters have been proposed: Q(F1)(2), Q(F2)(2), Q(F3)(2), r(m)(2), and the Golbraikh-Tropsha method. Recently, the concordance correlation coefficient (CCC, Lin), which simply verifies how small the differences are between experimental data and external data set predictions, independently of their range, was proposed by our group as an external validation parameter for use in QSAR studies. In our preliminary work, we demonstrated with thousands of simulated models that CCC is in good agreement with the compared validation criteria (except r(m)(2)) using the cutoff values normally applied for the acceptance of QSAR models as externally predictive. In this new work, we have studied and compared the general trends of the various criteria relative to different possible biases (scale and location shifts) in external data distributions, using a wide range of different simulated scenarios. This study, further supported by visual inspection of experimental vs predicted data scatter plots, has highlighted problems related to some criteria. Indeed, if based on the cutoff suggested by the proponent, r(m)(2) could also accept not predictive models in two of the possible biases (location, location plus scale), while in the case of scale shift bias, it appears to be the most restrictive. Moreover, Q(F1)(2) and Q(F2)(2) showed some problems in one of the possible biases (scale shift). This analysis allowed us to also propose recalibrated, and intercomparable for the same data scatter, new thresholds for each criterion in defining a QSAR model as really externally predictive in a more precautionary approach. An analysis of the results revealed that the scatter plot of experimental vs predicted external data must always be evaluated to support the statistical criteria values: in some cases high statistical parameter values could hide models with unacceptable predictions.

摘要

回归 QSAR 模型性能的评估,包括拟合、稳健性和外部预测,至关重要。在过去的十年中,已经提出了不同的外部验证参数:Q(F1)(2)、Q(F2)(2)、Q(F3)(2)、r(m)(2)和 Golbraikh-Tropsha 方法。最近,我们小组提出了一致性相关系数(CCC,Lin)作为一种外部验证参数,用于 QSAR 研究,它简单地验证了实验数据与外部数据集预测之间的差异有多小,而与它们的范围无关。在我们的初步工作中,我们使用数千个模拟模型证明,CCC 与比较验证标准(除 r(m)(2)外)非常一致,使用通常用于接受 QSAR 模型作为外部可预测模型的截止值。在这项新工作中,我们研究并比较了不同可能的外部数据分布偏差(尺度和位置偏移)下各种标准的总体趋势,使用了广泛的不同模拟场景。这项研究进一步通过实验数据与预测数据散点图的直观检查得到支持,突出了与一些标准相关的问题。实际上,如果基于建议者提出的截止值,r(m)(2)也可以接受两种可能的偏差(位置、位置加尺度)中的不可预测模型,而在尺度偏移偏差的情况下,它似乎是最具限制性的。此外,Q(F1)(2)和 Q(F2)(2)在一种可能的偏差(尺度偏移)中显示出一些问题。这种分析还允许我们在更谨慎的方法中为每个标准提出新的、可重新校准的、可比较的阈值,以便将 QSAR 模型定义为真正的外部可预测模型。对结果的分析表明,必须始终评估实验数据与预测外部数据的散点图,以支持统计标准值:在某些情况下,高统计参数值可能隐藏了预测不可接受的模型。

相似文献

1
Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection.QSAR 模型的实际外部预测能力。第 2 部分。不同验证标准的新可比阈值以及需要进行散点图检查。
J Chem Inf Model. 2012 Aug 27;52(8):2044-58. doi: 10.1021/ci300084j. Epub 2012 Jul 13.
2
Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient.QSAR 模型的真实外部预测能力:如何评估?不同验证标准的比较及使用一致性相关系数的建议。
J Chem Inf Model. 2011 Sep 26;51(9):2320-35. doi: 10.1021/ci200211n. Epub 2011 Aug 12.
3
Comparative studies on some metrics for external validation of QSPR models.比较研究 QSPR 模型外部验证的一些指标。
J Chem Inf Model. 2012 Feb 27;52(2):396-408. doi: 10.1021/ci200520g. Epub 2012 Jan 17.
4
Predictive QSAR modeling of HIV reverse transcriptase inhibitor TIBO derivatives.HIV逆转录酶抑制剂替博(TIBO)衍生物的预测性定量构效关系建模
Eur J Med Chem. 2009 Apr;44(4):1509-24. doi: 10.1016/j.ejmech.2008.07.020. Epub 2008 Jul 24.
5
Internal and external validation of the long-term QSARs for neutral organics to fish from ECOSAR™.ECOSAR™ 中中性有机物对鱼类长期 QSAR 的内部和外部验证。
SAR QSAR Environ Res. 2011 Jul-Sep;22(5-6):545-59. doi: 10.1080/1062936X.2011.569949. Epub 2011 Jul 7.
6
Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis.针对梨形四膜虫测试的化学毒物的组合定量构效关系建模。
J Chem Inf Model. 2008 Apr;48(4):766-84. doi: 10.1021/ci700443v. Epub 2008 Mar 1.
7
Is regression through origin useful in external validation of QSAR models?通过原点回归在QSAR模型的外部验证中有用吗?
Eur J Pharm Sci. 2014 Aug 1;59:31-5. doi: 10.1016/j.ejps.2014.03.007. Epub 2014 Apr 8.
8
Determination and prediction of xenoestrogens by recombinant yeast-based assay and QSAR.基于重组酵母检测法和定量构效关系对异雌激素的测定与预测
Chemosphere. 2009 Mar;74(9):1152-7. doi: 10.1016/j.chemosphere.2008.11.081. Epub 2009 Jan 10.
9
Prediction of rodent carcinogenic potential of naturally occurring chemicals in the human diet using high-throughput QSAR predictive modeling.使用高通量定量构效关系预测模型预测人类饮食中天然存在的化学物质的啮齿动物致癌潜力。
Toxicol Appl Pharmacol. 2007 Jul 1;222(1):1-16. doi: 10.1016/j.taap.2007.03.012. Epub 2007 Mar 24.
10
External validation and prediction employing the predictive squared correlation coefficient test set activity mean vs training set activity mean.采用预测平方相关系数检验集活性均值与训练集活性均值进行外部验证和预测。
J Chem Inf Model. 2008 Nov;48(11):2140-5. doi: 10.1021/ci800253u.

引用本文的文献

1
Predicting EGFR Inhibitory Effect of Osimertinib Derivatives by Mixed Kernel SVM Enhanced with CLPSO.基于CLPSO增强的混合核支持向量机预测奥希替尼衍生物的表皮生长因子受体抑制作用
Pharmaceuticals (Basel). 2025 Jul 23;18(8):1092. doi: 10.3390/ph18081092.
2
From structure to strategy: chemometric modeling for the prediction of terminal half-life of pharmaceuticals and its role in future therapeutics.从结构到策略:药物终末半衰期预测的化学计量学建模及其在未来治疗中的作用
Mol Divers. 2025 Aug 21. doi: 10.1007/s11030-025-11322-3.
3
prediction of p values using explainable deep learning methods.
使用可解释深度学习方法预测p值。
J Pharm Anal. 2025 Jun;15(6):101174. doi: 10.1016/j.jpha.2024.101174. Epub 2024 Dec 28.
4
Assessment of the rat acute oral toxicity of quinoline-based pharmaceutical scaffold molecules using QSTR, q-RASTR and machine learning methods.使用定量结构-活性关系(QSTR)、定量响应-活性关系(q-RASTR)和机器学习方法评估喹啉类药物支架分子的大鼠急性经口毒性。
Mol Divers. 2025 Jun 27. doi: 10.1007/s11030-025-11265-9.
5
Corrosion inhibition of aluminum alloy in HCl by SDS: experimental, SEM/AFM imaging, and computational insights (DFT and MD simulations).十二烷基硫酸钠对铝合金在盐酸中的缓蚀作用:实验、扫描电子显微镜/原子力显微镜成像及计算分析(密度泛函理论和分子动力学模拟)
J Mol Model. 2025 May 27;31(6):172. doi: 10.1007/s00894-025-06391-y.
6
Unveiling the interspecies correlation and sensitivity factor analysis of rat and mouse acute oral toxicity of antimicrobial agents: first QSTR and QTTR Modeling report.揭示抗菌剂对大鼠和小鼠急性经口毒性的种间相关性和敏感性因子分析:首个定量结构-毒性关系(QSTR)和定量时间-毒性关系(QTTR)建模报告
Toxicol Res (Camb). 2024 Nov 16;13(6):tfae191. doi: 10.1093/toxres/tfae191. eCollection 2024 Dec.
7
Predicting the Time-Dependent Toxicities of Binary Mixtures of Five Antibiotics to sp.- Based on the QSAR Model.基于定量构效关系模型预测五种抗生素二元混合物对sp.的时间依赖性毒性
Environ Health (Wash). 2024 Apr 17;2(7):465-473. doi: 10.1021/envhealth.4c00001. eCollection 2024 Jul 19.
8
Molecular Interactions Governing the Rat Aryl Hydrocarbon Receptor Activities of Polycyclic Aromatic Compounds and Predictive Model Development.多环芳烃类化合物调控大鼠芳香烃受体活性的分子相互作用及预测模型的建立。
Molecules. 2024 Sep 29;29(19):4619. doi: 10.3390/molecules29194619.
9
Applicability Domain for Trustable Predictions.可信赖预测的适用域。
Methods Mol Biol. 2025;2834:131-149. doi: 10.1007/978-1-0716-4003-6_6.
10
The rat acute oral toxicity of trifluoromethyl compounds (TFMs): a computational toxicology study combining the 2D-QSTR, read-across and consensus modeling methods.三氟甲基化合物(TFMs)的大鼠急性经口毒性:结合 2D-QSTR、read-across 和共识建模方法的计算毒理学研究。
Arch Toxicol. 2024 Jul;98(7):2213-2229. doi: 10.1007/s00204-024-03739-w. Epub 2024 Apr 16.