• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

英国预测模型中缺失数据的处理不一致:方法使用回顾。

Missing data was handled inconsistently in UK prediction models: a review of method used.

机构信息

Centre for Health Informatics, Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK.

Centre for Health Informatics, Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK.

出版信息

J Clin Epidemiol. 2021 Dec;140:149-158. doi: 10.1016/j.jclinepi.2021.09.008. Epub 2021 Sep 11.

DOI:10.1016/j.jclinepi.2021.09.008
PMID:34520847
Abstract

OBJECTIVES

No clear guidance exists on handling missing data at each stage of developing, validating and implementing a clinical prediction model (CPM). We aimed to review the approaches to handling missing data that underly the CPMs currently recommended for use in UK healthcare.

STUDY DESIGN AND SETTING

A descriptive cross-sectional meta-epidemiological study aiming to identify CPMs recommended by the National Institute for Health and Care Excellence (NICE), which summarized how missing data is handled across their pipelines.

RESULTS

A total of 23 CPMs were included through "sampling strategy." Six missing data strategies were identified: complete case analysis (CCA), multiple imputation, imputation of mean values, k-nearest neighbours imputation, using an additional category for missingness, considering missing values as risk-factor-absent. 52% of the development articles and 48% of the validation articles did not report how missing data were handled. CCA was the most common approach used for development (40%) and validation (44%). At implementation, 57% of the CPMs required complete data entry, whilst 43% allowed missing values. Three CPMs had consistent paths in their pipelines.

CONCLUSION

A broad variety of methods for handling missing data underly the CPMs currently recommended for use in UK healthcare. Missing data handling strategies were generally inconsistent. Better quality assurance of CPMs needs greater clarity and consistency in handling of missing data.

摘要

目的

在开发、验证和实施临床预测模型(CPM)的各个阶段,对于如何处理缺失数据,目前尚无明确的指导意见。我们旨在回顾目前在英国医疗保健中推荐使用的 CPM 所采用的处理缺失数据的方法。

研究设计和设置

本研究为描述性的横断面荟萃流行病学研究,旨在确定英国国家卫生与保健优化研究所(NICE)推荐的 CPM,总结了它们的研究流程中是如何处理缺失数据的。

结果

通过“抽样策略”共纳入了 23 个 CPM。确定了 6 种缺失数据策略:完全病例分析(CCA)、多重插补、均值插补、k 近邻插补、使用缺失的附加类别、将缺失值视为风险因素缺失。在开发文章中,有 52%未报告缺失数据的处理方法,在验证文章中,有 48%未报告缺失数据的处理方法。CCA 是开发(40%)和验证(44%)中最常用的方法。在实施阶段,57%的 CPM 需要完整的数据输入,而 43%的 CPM 允许缺失值。有 3 个 CPM 在其研究流程中具有一致的路径。

结论

目前在英国医疗保健中推荐使用的 CPM 所采用的处理缺失数据的方法多种多样。缺失数据处理策略通常不一致。为了更好地保证 CPM 的质量,需要在处理缺失数据方面更加明确和一致。

相似文献

1
Missing data was handled inconsistently in UK prediction models: a review of method used.英国预测模型中缺失数据的处理不一致:方法使用回顾。
J Clin Epidemiol. 2021 Dec;140:149-158. doi: 10.1016/j.jclinepi.2021.09.008. Epub 2021 Sep 11.
2
Missing data is poorly handled and reported in prediction model studies using machine learning: a literature review.机器学习预测模型研究中对缺失数据的处理和报告很差劲:文献综述。
J Clin Epidemiol. 2022 Feb;142:218-229. doi: 10.1016/j.jclinepi.2021.11.023. Epub 2021 Nov 16.
3
Imputation and missing indicators for handling missing data in the development and deployment of clinical prediction models: A simulation study.处理临床预测模型开发和部署中缺失数据的插补和缺失指标:一项模拟研究。
Stat Methods Med Res. 2023 Aug;32(8):1461-1477. doi: 10.1177/09622802231165001. Epub 2023 Apr 27.
4
Incorporating informatively collected laboratory data from EHR in clinical prediction models.将电子健康记录中信息采集的实验室数据纳入临床预测模型中。
BMC Med Inform Decis Mak. 2024 Jul 24;24(1):206. doi: 10.1186/s12911-024-02612-1.
5
Imputation and Missing Indicators for Handling Missing Longitudinal Data: Data Simulation Analysis Based on Electronic Health Record Data.处理纵向缺失数据的插补与缺失指示符:基于电子健康记录数据的模拟分析
JMIR Med Inform. 2025 Mar 13;13:e64354. doi: 10.2196/64354.
6
Identify the most appropriate imputation method for handling missing values in clinical structured datasets: a systematic review.识别处理临床结构化数据集缺失值的最合适插补方法:系统评价。
BMC Med Res Methodol. 2024 Aug 28;24(1):188. doi: 10.1186/s12874-024-02310-6.
7
The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study.预后模型的性能取决于缺失值插补算法的选择:一项模拟研究。
J Clin Epidemiol. 2024 Dec;176:111539. doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.
8
When and how should multiple imputation be used for handling missing data in randomised clinical trials - a practical guide with flowcharts.何时以及如何在随机临床试验中使用多重插补来处理缺失数据——附流程图的实用指南。
BMC Med Res Methodol. 2017 Dec 6;17(1):162. doi: 10.1186/s12874-017-0442-1.
9
Strategies for handling missing data that improve Frailty Index estimation and predictive power: lessons from the NHANES dataset.处理缺失数据的策略可改善衰弱指数的估算和预测能力:来自 NHANES 数据集的经验教训。
Geroscience. 2022 Apr;44(2):897-923. doi: 10.1007/s11357-021-00489-w. Epub 2022 Feb 1.
10
Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.比较处理二分类暴露因素倾向性评分估计中协变量缺失的方法。
BMC Med Res Methodol. 2020 Jun 26;20(1):168. doi: 10.1186/s12874-020-01053-4.

引用本文的文献

1
Postpartum depression risk prediction using explainable machine learning algorithms.使用可解释机器学习算法预测产后抑郁风险
Front Med (Lausanne). 2025 Aug 7;12:1565374. doi: 10.3389/fmed.2025.1565374. eCollection 2025.
2
Interpretable Machine Learning for Predicting Anterior Uveitis in Axial Spondyloarthritis.用于预测轴性脊柱关节炎前葡萄膜炎的可解释机器学习
J Clin Rheumatol. 2025 Aug 1;31(5):e42-e48. doi: 10.1097/RHU.0000000000002225. Epub 2025 Apr 25.
3
Decoding uncertainty for clinical decision-making.解读临床决策中的不确定性
Philos Trans A Math Phys Eng Sci. 2025 Mar 13;383(2292):20240207. doi: 10.1098/rsta.2024.0207.
4
Overcoming Missing Data: Accurately Predicting Cardiovascular Risk in Type 2 Diabetes, A Systematic Review.克服数据缺失:2型糖尿病心血管风险的准确预测,一项系统评价
J Diabetes. 2025 Jan;17(1):e70049. doi: 10.1111/1753-0407.70049.
5
Handling missing data and measurement error for early-onset myopia risk prediction models.处理早发性近视风险预测模型中的缺失数据和测量误差。
BMC Med Res Methodol. 2024 Sep 6;24(1):194. doi: 10.1186/s12874-024-02319-x.
6
The impact of different imputation methods on estimates and model performance: an example using a risk prediction model for premature mortality.不同插补方法对估计值和模型性能的影响:使用一个用于预测过早死亡风险的模型的示例。
Popul Health Metr. 2024 Jun 17;22(1):13. doi: 10.1186/s12963-024-00331-3.
7
Prediction of incident chronic kidney disease in community-based electronic health records: a systematic review and meta-analysis.基于社区电子健康记录的新发慢性肾脏病预测:一项系统评价和荟萃分析。
Clin Kidney J. 2024 Apr 18;17(5):sfae098. doi: 10.1093/ckj/sfae098. eCollection 2024 May.
8
Evaluation of clinical prediction models (part 2): how to undertake an external validation study.临床预测模型的评估(第 2 部分):如何进行外部验证研究。
BMJ. 2024 Jan 15;384:e074820. doi: 10.1136/bmj-2023-074820.
9
Making the Improbable Possible: Generalizing Models Designed for a Syndrome-Based, Heterogeneous Patient Landscape.从不可能到可能:推广基于综合征、异质患者群体设计的模型。
Crit Care Clin. 2023 Oct;39(4):751-768. doi: 10.1016/j.ccc.2023.02.003. Epub 2023 Apr 26.
10
Road map for clinicians to develop and evaluate AI predictive models to inform clinical decision-making.为临床医生制定和评估人工智能预测模型以辅助临床决策的路线图。
BMJ Health Care Inform. 2023 Aug;30(1). doi: 10.1136/bmjhci-2023-100784.