• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测模型的置信度分数。

Confidence scores for prediction models.

作者信息

Gerds Thomas A, van de Wiel Mark A

出版信息

Biom J. 2011 Mar;53(2):259-74. doi: 10.1002/bimj.201000157. Epub 2011 Feb 17.

DOI:10.1002/bimj.201000157
PMID:21328604
Abstract

In medical statistics, many alternative strategies are available for building a prediction model based on training data. Prediction models are routinely compared by means of their prediction performance in independent validation data. If only one data set is available for training and validation, then rival strategies can still be compared based on repeated bootstraps of the same data. Often, however, the overall performance of rival strategies is similar and it is thus difficult to decide for one model. Here, we investigate the variability of the prediction models that results when the same modelling strategy is applied to different training sets. For each modelling strategy we estimate a confidence score based on the same repeated bootstraps. A new decomposition of the expected Brier score is obtained, as well as the estimates of population average confidence scores. The latter can be used to distinguish rival prediction models with similar prediction performances. Furthermore, on the subject level a confidence score may provide useful supplementary information for new patients who want to base a medical decision on predicted risk. The ideas are illustrated and discussed using data from cancer studies, also with high-dimensional predictor space.

摘要

在医学统计学中,有许多可供选择的策略可用于基于训练数据构建预测模型。预测模型通常通过其在独立验证数据中的预测性能进行比较。如果只有一个数据集可用于训练和验证,那么仍然可以基于对同一数据的重复自助法来比较竞争策略。然而,通常情况下,竞争策略的整体性能相似,因此很难确定采用哪种模型。在此,我们研究了将相同建模策略应用于不同训练集时预测模型的变异性。对于每种建模策略,我们基于相同的重复自助法估计一个置信分数。得到了预期布里尔分数的一种新分解,以及总体平均置信分数的估计值。后者可用于区分具有相似预测性能的竞争预测模型。此外,在个体层面,置信分数可能为那些希望基于预测风险做出医疗决策的新患者提供有用的补充信息。我们使用癌症研究的数据(包括高维预测变量空间的数据)对这些想法进行了说明和讨论。

相似文献

1
Confidence scores for prediction models.预测模型的置信度分数。
Biom J. 2011 Mar;53(2):259-74. doi: 10.1002/bimj.201000157. Epub 2011 Feb 17.
2
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
3
[Meta-analysis of the Italian studies on short-term effects of air pollution].[意大利关于空气污染短期影响研究的荟萃分析]
Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.
4
Alternative methods to evaluate trial level surrogacy.评估试验水平替代指标的替代方法。
Clin Trials. 2008;5(3):194-208. doi: 10.1177/1740774508091677.
5
Estimation of prediction error for survival models.生存模型预测误差估计。
Stat Med. 2010 Jan 30;29(2):262-74. doi: 10.1002/sim.3758.
6
Ensemble methods for classification of patients for personalized medicine with high-dimensional data.用于基于高维数据的个性化医疗中患者分类的集成方法。
Artif Intell Med. 2007 Nov;41(3):197-207. doi: 10.1016/j.artmed.2007.07.003. Epub 2007 Aug 23.
7
A multi-class predictor based on a probabilistic model: application to gene expression profiling-based diagnosis of thyroid tumors.基于概率模型的多分类预测器:应用于基于基因表达谱的甲状腺肿瘤诊断
BMC Genomics. 2006 Jul 27;7:190. doi: 10.1186/1471-2164-7-190.
8
Error bounds for data-driven models of dynamical systems.动力系统数据驱动模型的误差界
Comput Biol Med. 2007 May;37(5):670-9. doi: 10.1016/j.compbiomed.2006.06.005. Epub 2006 Aug 8.
9
Regression equations in clinical neuropsychology: an evaluation of statistical methods for comparing predicted and obtained scores.临床神经心理学中的回归方程:比较预测分数与实际获得分数的统计方法评估
J Clin Exp Neuropsychol. 1998 Oct;20(5):755-62. doi: 10.1076/jcen.20.5.755.1132.
10
Cancer risk prediction models: a workshop on development, evaluation, and application.癌症风险预测模型:关于开发、评估及应用的研讨会
J Natl Cancer Inst. 2005 May 18;97(10):715-23. doi: 10.1093/jnci/dji128.

引用本文的文献

1
Information Extraction From Electronic Health Records to Predict Readmission Following Acute Myocardial Infarction: Does Natural Language Processing Using Clinical Notes Improve Prediction of Readmission?从电子健康记录中提取信息以预测急性心肌梗死患者的再入院情况:使用临床记录进行自然语言处理是否能提高再入院预测的准确性?
J Am Heart Assoc. 2022 Apr 5;11(7):e024198. doi: 10.1161/JAHA.121.024198. Epub 2022 Mar 24.
2
Development of Electronic Health Record-Based Prediction Models for 30-Day Readmission Risk Among Patients Hospitalized for Acute Myocardial Infarction.基于电子健康记录的急性心肌梗死住院患者 30 天再入院风险预测模型的建立。
JAMA Netw Open. 2021 Jan 4;4(1):e2035782. doi: 10.1001/jamanetworkopen.2020.35782.
3
Evaluating Random Forests for Survival Analysis using Prediction Error Curves.
使用预测误差曲线评估随机森林用于生存分析
J Stat Softw. 2012 Sep;50(11):1-23. doi: 10.18637/jss.v050.i11.
4
stepwiseCM: An R Package for Stepwise Classification of Cancer Samples Using Multiple Heterogeneous Data Sets.
Cancer Inform. 2014 Jan 2;13:1-11. doi: 10.4137/CIN.S13075. eCollection 2014.