• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用电子健康记录数据评估风险预测模型

EVALUATING RISK-PREDICTION MODELS USING DATA FROM ELECTRONIC HEALTH RECORDS.

作者信息

Wang L E, Shaw Pamela A, Mathelier Hansie M, Kimmel Stephen E, French Benjamin

机构信息

DEPARTMENT OF BIOSTATISTICS AND EPIDEMIOLOGY, UNIVERSITY OF PENNSYLVANIA, 423 GUARDIAN DRIVE, PHILADELPHIA, PENNSYLVANIA 19104, USA.

DEPARTMENT OF MEDICINE, UNIVERSITY OF PENNSYLVANIA, 51 N 39TH STREET, PHILADELPHIA, PENNSYLVANIA 19104, USA.

出版信息

Ann Appl Stat. 2016 Mar;10(1):286-304. doi: 10.1214/15-AOAS891.

DOI:10.1214/15-AOAS891
PMID:27158296
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4859766/
Abstract

The availability of data from electronic health records facilitates the development and evaluation of risk-prediction models, but estimation of prediction accuracy could be limited by outcome misclassification, which can arise if events are not captured. We evaluate the robustness of prediction accuracy summaries, obtained from receiver operating characteristic curves and risk-reclassification methods, if events are not captured (i.e., "false negatives"). We derive estimators for sensitivity and specificity if misclassification is independent of marker values. In simulation studies, we quantify the potential for bias in prediction accuracy summaries if misclassification depends on marker values. We compare the accuracy of alternative prognostic models for 30-day all-cause hospital readmission among 4548 patients discharged from the University of Pennsylvania Health System with a primary diagnosis of heart failure. Simulation studies indicate that if misclassification depends on marker values, then the estimated accuracy improvement is also biased, but the direction of the bias depends on the direction of the association between markers and the probability of misclassification. In our application, 29% of the 1143 readmitted patients were readmitted to a hospital elsewhere in Pennsylvania, which reduced prediction accuracy. Outcome misclassification can result in erroneous conclusions regarding the accuracy of risk-prediction models.

摘要

电子健康记录中的数据可用性促进了风险预测模型的开发和评估,但预测准确性的估计可能会受到结局误分类的限制,如果事件未被捕获,就可能出现这种情况。我们评估了从接受者操作特征曲线和风险重新分类方法获得的预测准确性总结在事件未被捕获(即“假阴性”)时的稳健性。如果误分类与标志物值无关,我们推导出敏感性和特异性的估计量。在模拟研究中,如果误分类取决于标志物值,我们量化预测准确性总结中偏差的可能性。我们比较了宾夕法尼亚大学医疗系统出院的4548例原发性心力衰竭患者30天全因再入院的替代预后模型的准确性。模拟研究表明,如果误分类取决于标志物值,那么估计的准确性改善也会有偏差,但偏差的方向取决于标志物与误分类概率之间关联的方向。在我们的应用中,1143例再入院患者中有29%在宾夕法尼亚州的其他医院再入院,这降低了预测准确性。结局误分类可能导致关于风险预测模型准确性的错误结论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/034d/4859766/4ce86fd1e16e/nihms-780125-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/034d/4859766/4ce86fd1e16e/nihms-780125-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/034d/4859766/4ce86fd1e16e/nihms-780125-f0001.jpg

相似文献

1
EVALUATING RISK-PREDICTION MODELS USING DATA FROM ELECTRONIC HEALTH RECORDS.使用电子健康记录数据评估风险预测模型
Ann Appl Stat. 2016 Mar;10(1):286-304. doi: 10.1214/15-AOAS891.
2
Corrected ROC analysis for misclassified binary outcomes.针对误分类二元结局的校正ROC分析。
Stat Med. 2017 Jun 15;36(13):2148-2160. doi: 10.1002/sim.7260. Epub 2017 Feb 28.
3
Inflation of type I error rates due to differential misclassification in EHR-derived outcomes: Empirical illustration using breast cancer recurrence.由于电子病历衍生结局的差异误分类导致 I 类错误率膨胀:基于乳腺癌复发的实证说明。
Pharmacoepidemiol Drug Saf. 2019 Feb;28(2):264-268. doi: 10.1002/pds.4680. Epub 2018 Oct 30.
4
Risk of 30-Day Hospital Readmission Among Patients Discharged to Skilled Nursing Facilities: Development and Validation of a Risk-Prediction Model.患者出院至护理院 30 天内再入院风险:风险预测模型的建立与验证。
J Am Med Dir Assoc. 2019 Apr;20(4):444-450.e2. doi: 10.1016/j.jamda.2019.01.137. Epub 2019 Mar 7.
5
Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification.基于电子健康记录的关联研究的统计推断:处理选择偏倚和结局错误分类。
Biometrics. 2022 Mar;78(1):214-226. doi: 10.1111/biom.13400. Epub 2020 Dec 3.
6
The effect of misclassification on the estimation of association: a review.错误分类对关联估计的影响:综述
Int J Methods Psychiatr Res. 2005;14(2):92-101. doi: 10.1002/mpr.20.
7
Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies.全基因组关联研究中具有特定结果错误分类概率的二元反应分析。
Appl Clin Genet. 2016 Nov 30;9:169-177. doi: 10.2147/TACG.S122250. eCollection 2016.
8
Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.利用公开可用的行政数据库预测30天再入院情况。一种条件逻辑回归建模方法。
Methods Inf Med. 2015;54(6):560-7. doi: 10.3414/ME14-02-0017. Epub 2015 Nov 9.
9
Bias from outcome misclassification in immunization schedule safety research.免疫接种计划安全性研究中结局错误分类导致的偏倚。
Pharmacoepidemiol Drug Saf. 2018 Feb;27(2):221-228. doi: 10.1002/pds.4374. Epub 2018 Jan 2.
10
A primer on quantitative bias analysis with positive predictive values in research using electronic health data.利用电子健康数据进行研究时,关于阳性预测值的定量偏倚分析入门指南。
J Am Med Inform Assoc. 2019 Dec 1;26(12):1664-1674. doi: 10.1093/jamia/ocz094.

引用本文的文献

1
Effect of nirmatrelvir/ritonavir (Paxlovid) on hospitalization among adults with COVID-19: An electronic health record-based target trial emulation from N3C.奈玛特韦/利托那韦(帕罗韦德)对成人新冠病毒感染患者住院率的影响:一项基于电子健康记录的来自N3C的目标试验模拟研究
PLoS Med. 2025 Jan 17;22(1):e1004493. doi: 10.1371/journal.pmed.1004493. eCollection 2025 Jan.
2
Effect of Paxlovid Treatment During Acute Covid-19 on Long Covid Onset: An EHR-Based Target Trial Emulation from the N3C and RECOVER Consortia.急性新冠病毒感染期间使用帕罗韦德治疗对新冠后遗症发病的影响:一项基于电子健康记录的目标试验模拟研究,来自N3C和RECOVER联盟。
medRxiv. 2025 Apr 7:2024.01.20.24301525. doi: 10.1101/2024.01.20.24301525.
3
AutoScore-Ordinal: an interpretable machine learning framework for generating scoring models for ordinal outcomes.AutoScore-Ordinal:一种可解释的机器学习框架,用于生成有序结局的评分模型。
BMC Med Res Methodol. 2022 Nov 4;22(1):286. doi: 10.1186/s12874-022-01770-y.
4
Measurement error and misclassification in electronic medical records: methods to mitigate bias.电子病历中的测量误差和错误分类:减轻偏差的方法。
Curr Epidemiol Rep. 2018 Dec;5(4):343-356. doi: 10.1007/s40471-018-0164-x. Epub 2018 Sep 10.
5
Cox regression is robust to inaccurate EHR-extracted event time: an application to EHR-based GWAS.Cox 回归对 EHR 提取的事件时间不准确具有稳健性:基于 EHR 的 GWAS 的应用。
Bioinformatics. 2022 Apr 12;38(8):2297-2306. doi: 10.1093/bioinformatics/btac086.
6
Prediction models for hospital readmissions in patients with heart disease: a systematic review and meta-analysis.预测心脏病患者住院再入院的模型:系统评价和荟萃分析。
BMJ Open. 2021 Aug 17;11(8):e047576. doi: 10.1136/bmjopen-2020-047576.
7
A Real-time Risk-Prediction Model for Pediatric Venous Thromboembolic Events.儿科静脉血栓栓塞事件实时风险预测模型。
Pediatrics. 2021 Jun;147(6). doi: 10.1542/peds.2020-042325. Epub 2021 May 19.
8
Use of Machine Learning to Develop and Evaluate Models Using Preoperative and Intraoperative Data to Identify Risks of Postoperative Complications.使用机器学习,结合术前和术中数据开发和评估模型,以识别术后并发症的风险。
JAMA Netw Open. 2021 Mar 1;4(3):e212240. doi: 10.1001/jamanetworkopen.2021.2240.
9
Improved generalized raking estimators to address dependent covariate and failure-time outcome error.改进的广义耙式估计器,以解决相关协变量和失效时间结果误差。
Biom J. 2021 Jun;63(5):1006-1027. doi: 10.1002/bimj.202000187. Epub 2021 Mar 11.
10
Disease risk scores for skin cancers.皮肤癌疾病风险评分。
Nat Commun. 2021 Jan 8;12(1):160. doi: 10.1038/s41467-020-20246-5.

本文引用的文献

1
Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data.使用内部验证数据进行多重插补来处理二项回归模型中的错误分类结局。
Am J Epidemiol. 2013 May 1;177(9):904-12. doi: 10.1093/aje/kws340. Epub 2013 Apr 4.
2
Net reclassification indices for evaluating risk prediction instruments: a critical review.用于评估风险预测工具的净重新分类指数:批判性评价。
Epidemiology. 2014 Jan;25(1):114-21. doi: 10.1097/EDE.0000000000000018.
3
The readmission risk flag: using the electronic health record to automatically identify patients at risk for 30-day readmission.再入院风险标识:利用电子健康记录自动识别有30天再入院风险的患者。
J Hosp Med. 2013 Dec;8(12):689-95. doi: 10.1002/jhm.2106. Epub 2013 Nov 13.
4
A note on the evaluation of novel biomarkers: do not rely on integrated discrimination improvement and net reclassification index.关于新型生物标志物评估的一则注释:不要依赖综合判别改善和净重新分类指数。
Stat Med. 2014 Aug 30;33(19):3405-14. doi: 10.1002/sim.5804. Epub 2013 Apr 2.
5
Use of health IT for higher-value critical care.利用健康信息技术实现更高价值的重症监护。
N Engl J Med. 2013 Feb 14;368(7):594-7. doi: 10.1056/NEJMp1213273. Epub 2013 Jan 30.
6
Time for a creative transformation of epidemiology in the United States.是时候对美国的流行病学进行创造性变革了。
JAMA. 2012 Nov 7;308(17):1804-5. doi: 10.1001/jama.2012.14838.
7
A unified inference procedure for a class of measures to assess improvement in risk prediction systems with survival data.一类用于评估生存数据中风险预测系统改善的度量的统一推断方法。
Stat Med. 2013 Jun 30;32(14):2430-42. doi: 10.1002/sim.5647. Epub 2012 Oct 5.
8
Evaluating a New Risk Marker's Predictive Contribution in Survival Models.评估一种新的风险标志物在生存模型中的预测贡献。
J Stat Theory Pract. 2010 Dec 1;4(4):845-855. doi: 10.1080/15598608.2010.10412022.
9
Using audit information to adjust parameter estimates for data errors in clinical trials.利用审核信息调整临床试验中数据错误的参数估计。
Clin Trials. 2012 Dec;9(6):721-9. doi: 10.1177/1740774512450100. Epub 2012 Jul 30.
10
Development and evaluation of multi-marker risk scores for clinical prognosis.用于临床预后的多标志物风险评分的开发与评估
Stat Methods Med Res. 2016 Feb;25(1):255-71. doi: 10.1177/0962280212451881. Epub 2012 Jul 5.