• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

定量构效关系(QSAR)回归中与适用域相关的预测不确定性

Applicability Domain Dependent Predictive Uncertainty in QSAR Regressions.

作者信息

Sahlin U, Jeliazkova N, Öberg T

机构信息

Centre of Environmental and Climate Research, Lund University, Lund, Sweden phonex:+46 46 222 6831.

Department of Biology and Environmental Science, Linnaeus University, Kalmar, Sweden.

出版信息

Mol Inform. 2014 Jan;33(1):26-35. doi: 10.1002/minf.201200131. Epub 2013 Oct 7.

DOI:10.1002/minf.201200131
PMID:27485196
Abstract

Predictive models used in decision making, such as QSARs in chemical regulation or drug discovery, call for evaluated approaches to quantitatively assess associated uncertainty in predictions. Uncertainty in less reliable predictions may be captured by locally varying predictive errors. In the current study, model-based bootstrapping was combined with analogy reasoning to generate predictive distributions varying in magnitude over a model's domain of applicability. A resampling experiment based on PLS regressions on four QSAR data sets demonstrated that predictive errors assessed by k nearest neighbour or weighted PRedicted Error Sum of Squares (PRESS) on samples of external test data or by internal cross-validation improved the performance of the uncertainty assessment. Analogy using similarity defined by Euclidean distances, or differences in standard deviation in perturbed predictions, resulted in better performances than similarity defined by distance to, or density of, the training data. Locally assessed predictive distributions had on average at least as good coverage as Gaussian distribution with variance assessed from the PRESS. An R-code is provided that evaluates performances of the suggested algorithms to assess predictive error based on log likelihood scores and empirical coverage graphs, and which applies these to derive confidence intervals or samples from the predictive distributions of query compounds.

摘要

用于决策的预测模型,如化学监管或药物研发中的定量构效关系(QSARs),需要经过评估的方法来定量评估预测中相关的不确定性。可靠性较低的预测中的不确定性可以通过局部变化的预测误差来捕捉。在当前研究中,基于模型的自助法与类比推理相结合,以生成在模型适用范围内大小不同的预测分布。基于四个QSAR数据集的偏最小二乘回归的重采样实验表明,通过外部测试数据样本上的k近邻或加权预测误差平方和(PRESS)或通过内部交叉验证评估的预测误差提高了不确定性评估的性能。使用欧几里得距离定义的相似性或扰动预测中的标准差差异进行类比,其性能优于通过与训练数据的距离或密度定义的相似性。局部评估的预测分布平均至少具有与基于PRESS评估方差的高斯分布一样好的覆盖率。提供了一个R代码,用于评估基于对数似然分数和经验覆盖率图评估预测误差的建议算法的性能,并将其应用于从查询化合物的预测分布中推导置信区间或样本。

相似文献

1
Applicability Domain Dependent Predictive Uncertainty in QSAR Regressions.定量构效关系(QSAR)回归中与适用域相关的预测不确定性
Mol Inform. 2014 Jan;33(1):26-35. doi: 10.1002/minf.201200131. Epub 2013 Oct 7.
2
A Kernel-Based Method for Assessing Uncertainty on Individual QSAR Predictions.一种基于核方法的个体定量构效关系预测不确定性评估方法。
Mol Inform. 2012 Oct;31(10):741-51. doi: 10.1002/minf.201200053. Epub 2012 Sep 25.
3
A Risk Assessment Perspective of Current Practice in Characterizing Uncertainties in QSAR Regression Predictions.当前 QSAR 回归预测中不确定性特征化实践的风险评估视角。
Mol Inform. 2011 Jun;30(6-7):551-64. doi: 10.1002/minf.201000177. Epub 2011 May 5.
4
General Approach to Estimate Error Bars for Quantitative Structure-Activity Relationship Predictions of Molecular Activity.定量构效关系预测分子活性的误差估计的一般方法。
J Chem Inf Model. 2018 Aug 27;58(8):1561-1575. doi: 10.1021/acs.jcim.8b00114. Epub 2018 Jul 17.
5
DPRESS: Localizing estimates of predictive uncertainty.DPRESS:本地化预测不确定性的估计。
J Cheminform. 2009 Jul 14;1(1):11. doi: 10.1186/1758-2946-1-11.
6
Application of predictive QSAR models to database mining: identification and experimental validation of novel anticonvulsant compounds.预测性定量构效关系模型在数据库挖掘中的应用:新型抗惊厥化合物的鉴定与实验验证
J Med Chem. 2004 Apr 22;47(9):2356-64. doi: 10.1021/jm030584q.
7
Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: focusing on applicability domain and overfitting by variable selection.针对梨形四膜虫的环境毒性定量构效关系(QSAR)模型的批判性评估:聚焦适用域及变量选择导致的过拟合问题
J Chem Inf Model. 2008 Sep;48(9):1733-46. doi: 10.1021/ci800151m. Epub 2008 Aug 26.
8
Uncertainty in QSAR predictions.QSAR 预测中的不确定性。
Altern Lab Anim. 2013 Mar;41(1):111-25. doi: 10.1177/026119291304100111.
9
Assessing the reliability of a QSAR model's predictions.评估定量构效关系(QSAR)模型预测的可靠性。
J Mol Graph Model. 2005 Jun;23(6):503-23. doi: 10.1016/j.jmgm.2005.03.003.
10
Defining a novel k-nearest neighbours approach to assess the applicability domain of a QSAR model for reliable predictions.定义一种新的 k-最近邻方法来评估 QSAR 模型的适用域,以进行可靠的预测。
J Cheminform. 2013 May 30;5(1):27. doi: 10.1186/1758-2946-5-27.

引用本文的文献

1
A practical guide to machine-learning scoring for structure-based virtual screening.基于结构的虚拟筛选的机器学习评分实用指南。
Nat Protoc. 2023 Nov;18(11):3460-3511. doi: 10.1038/s41596-023-00885-w. Epub 2023 Oct 16.
2
Uncertainty assessment of proarrhythmia predictions derived from multi-level in silico models.多水平计算模型预测致心律失常性的不确定性评估。
Arch Toxicol. 2023 Oct;97(10):2721-2740. doi: 10.1007/s00204-023-03557-6. Epub 2023 Aug 1.
3
How Precise Are Our Quantitative Structure-Activity Relationship Derived Predictions for New Query Chemicals?
我们基于定量构效关系得出的针对新查询化学品的预测有多精确?
ACS Omega. 2018 Sep 19;3(9):11392-11406. doi: 10.1021/acsomega.8b01647. eCollection 2018 Sep 30.
4
Geometry optimization method versus predictive ability in QSPR modeling for ionic liquids.离子液体定量构效关系建模中的几何优化方法与预测能力
J Comput Aided Mol Des. 2016 Feb;30(2):165-76. doi: 10.1007/s10822-016-9894-3. Epub 2016 Feb 1.
5
Assessment of uncertainty in chemical models by Bayesian probabilities: Why, when, how?用贝叶斯概率评估化学模型中的不确定性:为何、何时以及如何?
J Comput Aided Mol Des. 2015 Jul;29(7):583-94. doi: 10.1007/s10822-014-9822-3. Epub 2014 Dec 10.
6
How accurately can we predict the melting points of drug-like compounds?我们能多准确地预测类药性化合物的熔点?
J Chem Inf Model. 2014 Dec 22;54(12):3320-9. doi: 10.1021/ci5005288. Epub 2014 Dec 9.
7
Using beta binomials to estimate classification uncertainty for ensemble models.使用贝塔二项式估计集成模型的分类不确定性。
J Cheminform. 2014 Jun 22;6:34. doi: 10.1186/1758-2946-6-34. eCollection 2014.