• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

缺失数据:一种实践的统计框架。

Missing data: A statistical framework for practice.

机构信息

Department of Medical Statistics, London School of Hygiene & Tropical Medicine, London, UK.

MRC Clinical Trials Unit at UCL, London, UK.

出版信息

Biom J. 2021 Jun;63(5):915-947. doi: 10.1002/bimj.202000196. Epub 2021 Feb 24.

DOI:10.1002/bimj.202000196
PMID:33624862
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7615108/
Abstract

Missing data are ubiquitous in medical research, yet there is still uncertainty over when restricting to the complete records is likely to be acceptable, when more complex methods (e.g. maximum likelihood, multiple imputation and Bayesian methods) should be used, how they relate to each other and the role of sensitivity analysis. This article seeks to address both applied practitioners and researchers interested in a more formal explanation of some of the results. For practitioners, the framework, illustrative examples and code should equip them with a practical approach to address the issues raised by missing data (particularly using multiple imputation), alongside an overview of how the various approaches in the literature relate. In particular, we describe how multiple imputation can be readily used for sensitivity analyses, which are still infrequently performed. For those interested in more formal derivations, we give outline arguments for key results, use simple examples to show how methods relate, and references for full details. The ideas are illustrated with a cohort study, a multi-centre case control study and a randomised clinical trial.

摘要

在医学研究中,缺失数据是普遍存在的,但仍不确定何时限制使用完整记录是可以接受的,何时应该使用更复杂的方法(例如最大似然法、多重插补法和贝叶斯方法),以及它们之间的关系和敏感性分析的作用。本文旨在为对缺失数据(特别是使用多重插补)的一些结果进行更正式解释感兴趣的应用实践者和研究人员提供帮助。对于实践者来说,该框架、说明性示例和代码应该使他们能够采用一种实用的方法来解决缺失数据引起的问题,同时还概述了文献中各种方法的关系。特别是,我们描述了如何使用多重插补进行敏感性分析,而敏感性分析仍然很少进行。对于那些对更正式推导感兴趣的人,我们给出了关键结果的概要论证,使用简单的示例来说明方法之间的关系,并提供了详细信息的参考文献。这些想法通过一项队列研究、一项多中心病例对照研究和一项随机临床试验得到了说明。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/335adf905997/EMS177343-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/9d0aa91753ab/EMS177343-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/b855942ab476/EMS177343-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/b4aa13ff3532/EMS177343-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/d09b92b220e1/EMS177343-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/e2ca8384442f/EMS177343-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/335adf905997/EMS177343-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/9d0aa91753ab/EMS177343-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/b855942ab476/EMS177343-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/b4aa13ff3532/EMS177343-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/d09b92b220e1/EMS177343-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/e2ca8384442f/EMS177343-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5259/7615108/335adf905997/EMS177343-f006.jpg

相似文献

1
Missing data: A statistical framework for practice.缺失数据:一种实践的统计框架。
Biom J. 2021 Jun;63(5):915-947. doi: 10.1002/bimj.202000196. Epub 2021 Feb 24.
2
BAMITA: Bayesian multiple imputation for tensor arrays.BAMITA:张量数组的贝叶斯多重填补法
Biostatistics. 2024 Dec 31;26(1). doi: 10.1093/biostatistics/kxae047.
3
Development of a practical approach to expert elicitation for randomised controlled trials with missing health outcomes: Application to the IMPROVE trial.针对存在健康结局缺失情况的随机对照试验,开发一种实用的专家意见征集方法:在IMPROVE试验中的应用。
Clin Trials. 2017 Aug;14(4):357-367. doi: 10.1177/1740774517711442. Epub 2017 Jul 4.
4
An Approach to Addressing Multiple Imputation Model Uncertainty Using Bayesian Model Averaging.贝叶斯模型平均处理多重插补模型不确定性的方法。
Multivariate Behav Res. 2020 Jul-Aug;55(4):553-567. doi: 10.1080/00273171.2019.1657790. Epub 2019 Sep 20.
5
Multiple imputation of missing data in nested case-control and case-cohort studies.巢式病例对照研究和病例队列研究中缺失数据的多重填补
Biometrics. 2018 Dec;74(4):1438-1449. doi: 10.1111/biom.12910. Epub 2018 Jun 5.
6
Multiple imputation as a flexible tool for missing data handling in clinical research.多元插补作为临床研究中处理缺失数据的灵活工具。
Behav Res Ther. 2017 Nov;98:4-18. doi: 10.1016/j.brat.2016.11.008. Epub 2016 Nov 18.
7
Handling missing data in patient-level cost-effectiveness analysis alongside randomised clinical trials.在患者层面的成本效益分析中与随机临床试验一起处理缺失数据。
Appl Health Econ Health Policy. 2005;4(2):65-75. doi: 10.2165/00148365-200504020-00001.
8
Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study.考虑由于非随机缺失结局数据导致的偏倚:两种概率性偏倚分析方法的比较和说明:一项模拟研究。
BMC Med Res Methodol. 2024 Nov 13;24(1):278. doi: 10.1186/s12874-024-02382-4.
9
When and how should multiple imputation be used for handling missing data in randomised clinical trials - a practical guide with flowcharts.何时以及如何在随机临床试验中使用多重插补来处理缺失数据——附流程图的实用指南。
BMC Med Res Methodol. 2017 Dec 6;17(1):162. doi: 10.1186/s12874-017-0442-1.
10
Multiple imputation for longitudinal data using Bayesian lasso imputation model.基于贝叶斯套索插补模型的纵向数据多重插补方法
Stat Med. 2022 Mar 15;41(6):1042-1058. doi: 10.1002/sim.9315. Epub 2022 Jan 21.

引用本文的文献

1
Missing data in single-cell transcriptomes reveals transcriptional shifts.单细胞转录组中的缺失数据揭示了转录变化。
bioRxiv. 2025 Aug 21:2025.08.15.669765. doi: 10.1101/2025.08.15.669765.
2
Bayesian Analysis of Longitudinal Ordinal Data with Missing Values Using Multivariate Probit Models.使用多元概率单位模型对具有缺失值的纵向有序数据进行贝叶斯分析。
J Stat Appl Probab. 2025 May;14(3):337-352. doi: 10.18576/jsap/140302. Epub 2025 May 1.
3
Development and validation of a diagnostic prediction model for pancreatic ductal adenocarcinoma: VAPOR 1, protocol for a prospective multicentre case-control study.

本文引用的文献

1
Framework for the treatment and reporting of missing data in observational studies: The Treatment And Reporting of Missing data in Observational Studies framework.观察性研究中缺失数据的处理和报告框架:观察性研究中缺失数据的处理和报告框架。
J Clin Epidemiol. 2021 Jun;134:79-88. doi: 10.1016/j.jclinepi.2021.01.008. Epub 2021 Feb 2.
2
Common Methods for Handling Missing Data in Marginal Structural Models: What Works and Why.边缘结构模型中缺失数据处理的常用方法:什么方法有效,为什么有效。
Am J Epidemiol. 2021 Apr 6;190(4):663-672. doi: 10.1093/aje/kwaa225.
3
Sensitivity analysis for clinical trials with missing continuous outcome data using controlled multiple imputation: A practical guide.
胰腺导管腺癌诊断预测模型的开发与验证:VAPOR 1,一项前瞻性多中心病例对照研究方案
BMJ Open. 2025 Aug 27;15(8):e094505. doi: 10.1136/bmjopen-2024-094505.
4
Causal estimation of time-varying treatments in observational studies: a scoping review of methods, applications, and missing data practices.观察性研究中时变治疗的因果估计:方法、应用及缺失数据处理的范围综述
BMC Med Res Methodol. 2025 Aug 27;25(1):202. doi: 10.1186/s12874-025-02633-y.
5
A Bayesian life-course linear structural equations model (BLSEM) to explore the development of body mass index (BMI) from the prenatal stage until middle age.一种贝叶斯生命历程线性结构方程模型(BLSEM),用于探究从产前阶段到中年的体重指数(BMI)发展情况。
Int J Obes (Lond). 2025 Aug 20. doi: 10.1038/s41366-025-01857-8.
6
Bias and Efficiency Comparison between Multiple Imputation and Available-Case Analysis for Missing Data in Longitudinal Models.纵向模型中缺失数据的多重填补与有效病例分析之间的偏差和效率比较
Stat Biosci. 2025 Jun 12. doi: 10.1007/s12561-025-09493-6.
7
Load and Recovery Monitoring in Top-Level Youth Soccer Players: Exploring the Associations of a Web Application-Based Score With Recognized Load Measures.顶级青少年足球运动员的负荷与恢复监测:探究基于网络应用程序得分与公认负荷指标之间的关联
Eur J Sport Sci. 2025 Sep;25(9):e70031. doi: 10.1002/ejsc.70031.
8
Machine learning analysis of greenhouse gas sources impacting Africa's food security nexus.影响非洲粮食安全关系的温室气体来源的机器学习分析
Sci Rep. 2025 Aug 6;15(1):28665. doi: 10.1038/s41598-025-14766-7.
9
Performance of CAC-prob in predicting coronary artery calcium score: an external validation study in a high-CAC burden population.冠状动脉钙化预测概率(CAC-prob)在预测冠状动脉钙化评分方面的表现:一项针对高冠状动脉钙化负荷人群的外部验证研究。
BMC Med Inform Decis Mak. 2025 Aug 4;25(1):288. doi: 10.1186/s12911-025-03128-y.
10
Multiple Imputation of Missing Covariates When Using the Fine-Gray Model.使用Fine-Gray模型时缺失协变量的多重填补
Stat Med. 2025 Jul;44(15-17):e70166. doi: 10.1002/sim.70166.
使用对照多重填补法对具有缺失连续结局数据的临床试验进行敏感性分析:实用指南。
Stat Med. 2020 Sep 20;39(21):2815-2842. doi: 10.1002/sim.8569. Epub 2020 May 17.
4
A framework for extending trial design to facilitate missing data sensitivity analyses.扩展试验设计以促进缺失数据敏感性分析的框架。
BMC Med Res Methodol. 2020 Mar 17;20(1):66. doi: 10.1186/s12874-020-00930-2.
5
Reference-based sensitivity analysis for time-to-event data.基于参考的生存时间数据敏感性分析。
Pharm Stat. 2019 Nov;18(6):645-658. doi: 10.1002/pst.1954. Epub 2019 Jul 15.
6
Multiple imputation for discrete data: Evaluation of the joint latent normal model.离散数据的多重填补:联合潜在正态模型的评估
Biom J. 2019 Jul;61(4):1003-1019. doi: 10.1002/bimj.201800222. Epub 2019 Mar 14.
7
Information-anchored sensitivity analysis: theory and application.信息锚定敏感性分析:理论与应用
J R Stat Soc Ser A Stat Soc. 2019 Feb;182(2):623-645. doi: 10.1111/rssa.12423. Epub 2018 Nov 16.
8
Multiple imputation in Cox regression when there are time-varying effects of covariates.在协变量的时变效应存在时,Cox 回归中的多重插补。
Stat Med. 2018 Nov 10;37(25):3661-3678. doi: 10.1002/sim.7842. Epub 2018 Jul 16.
9
On the use of the not-at-random fully conditional specification (NARFCS) procedure in practice.关于在实践中使用非随机完全条件规范(NARFCS)程序。
Stat Med. 2018 Jul 10;37(15):2338-2353. doi: 10.1002/sim.7643. Epub 2018 Apr 2.
10
Development of a practical approach to expert elicitation for randomised controlled trials with missing health outcomes: Application to the IMPROVE trial.针对存在健康结局缺失情况的随机对照试验,开发一种实用的专家意见征集方法:在IMPROVE试验中的应用。
Clin Trials. 2017 Aug;14(4):357-367. doi: 10.1177/1740774517711442. Epub 2017 Jul 4.