• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多重填补:理论、实施与软件综述

Multiple imputation: review of theory, implementation and software.

作者信息

Harel Ofer, Zhou Xiao-Hua

机构信息

Department of Statistics, University of Connecticut, 215 Glenbrook Road Unit 4120 Storrs, CT 06269-4120, USA.

出版信息

Stat Med. 2007 Jul 20;26(16):3057-77. doi: 10.1002/sim.2787.

DOI:10.1002/sim.2787
PMID:17256804
Abstract

Missing data is a common complication in data analysis. In many medical settings missing data can cause difficulties in estimation, precision and inference. Multiple imputation (MI) (Multiple Imputation for Nonresponse in Surveys. Wiley: New York, 1987) is a simulation-based approach to deal with incomplete data. Although there are many different methods to deal with incomplete data, MI has become one of the leading methods. Since the late 1980s we observed a constant increase in the use and publication of MI-related research. This tutorial does not attempt to cover all the material concerning MI, but rather provides an overview and combines together the theory behind MI, the implementation of MI, and discusses increasing possibilities of the use of MI using commercial and free software. We illustrate some of the major points using an example from an Alzheimer disease (AD) study. In this AD study, while clinical data are available for all subjects, postmortem data are only available for the subset of those who died and underwent an autopsy. Analysis of incomplete data requires making unverifiable assumptions. These assumptions are discussed in detail in the text. Relevant S-Plus code is provided.

摘要

数据缺失是数据分析中常见的复杂问题。在许多医学环境中,数据缺失会在估计、精度和推断方面造成困难。多重填补(MI)(《调查中无回答的多重填补》。威利出版社:纽约,1987年)是一种基于模拟的处理不完整数据的方法。虽然有许多不同的方法来处理不完整数据,但多重填补已成为主要方法之一。自20世纪80年代末以来,我们观察到与多重填补相关的研究在使用和发表方面持续增加。本教程并不试图涵盖与多重填补有关的所有内容,而是提供一个概述,并将多重填补背后的理论、多重填补的实施结合在一起,同时讨论使用商业软件和免费软件增加多重填补使用的可能性。我们用一项阿尔茨海默病(AD)研究中的例子来说明一些要点。在这项AD研究中,虽然所有受试者都有临床数据,但尸检数据仅适用于那些死亡并接受了尸检的受试者子集。对不完整数据的分析需要做出无法验证的假设。文中将详细讨论这些假设。同时提供了相关的S-Plus代码。

相似文献

1
Multiple imputation: review of theory, implementation and software.多重填补:理论、实施与软件综述
Stat Med. 2007 Jul 20;26(16):3057-77. doi: 10.1002/sim.2787.
2
The use of multiple imputation for the analysis of missing data.使用多重填补法分析缺失数据。
Psychol Methods. 2001 Dec;6(4):317-29.
3
Multiple imputation to correct for partial verification bias revisited.再次探讨用于校正部分验证偏倚的多重填补法。
Stat Med. 2008 Dec 10;27(28):5880-9. doi: 10.1002/sim.3410.
4
Multiple imputation for missing income data in population-based health surveillance.基于人群的健康监测中缺失收入数据的多重插补。
J Public Health Manag Pract. 2009 Nov-Dec;15(6):E12-21. doi: 10.1097/PHH.0b013e3181aab5f7.
5
Missing data and imputation: a practical illustration in a prognostic study on low back pain.缺失数据与插补:腰痛预后研究中的实际例证
J Manipulative Physiol Ther. 2012 Jul;35(6):464-71. doi: 10.1016/j.jmpt.2012.07.002.
6
Bias and Precision of the "Multiple Imputation, Then Deletion" Method for Dealing With Missing Outcome Data.处理缺失结局数据的“多次插补,然后删除”方法的偏倚和精密度
Am J Epidemiol. 2015 Sep 15;182(6):528-34. doi: 10.1093/aje/kwv100. Epub 2015 Sep 2.
7
Evaluation of software for multiple imputation of semi-continuous data.半连续数据多重填补软件的评估
Stat Methods Med Res. 2007 Jun;16(3):243-58. doi: 10.1177/0962280206074464.
8
Multiple imputation: current perspectives.多重填补:当前观点
Stat Methods Med Res. 2007 Jun;16(3):199-218. doi: 10.1177/0962280206075304.
9
Multiple imputation of missing covariate values in multilevel models with random slopes: a cautionary note.具有随机斜率的多层模型中缺失协变量值的多重填补:一则警示说明。
Behav Res Methods. 2016 Jun;48(2):640-9. doi: 10.3758/s13428-015-0590-3.
10
Advanced statistics: missing data in clinical research--part 2: multiple imputation.高级统计学:临床研究中的缺失数据——第2部分:多重填补
Acad Emerg Med. 2007 Jul;14(7):669-78. doi: 10.1197/j.aem.2006.11.038.

引用本文的文献

1
Bayesian Analysis of Longitudinal Ordinal Data with Missing Values Using Multivariate Probit Models.使用多元概率单位模型对具有缺失值的纵向有序数据进行贝叶斯分析。
J Stat Appl Probab. 2025 May;14(3):337-352. doi: 10.18576/jsap/140302. Epub 2025 May 1.
2
Machine learning-based predictive modeling of angina pectoris in an elderly community-dwelling population: Results from the PoCOsteo study.基于机器学习的老年社区居住人群心绞痛预测模型:PoCOsteo研究结果
PLoS One. 2025 Aug 5;20(8):e0329023. doi: 10.1371/journal.pone.0329023. eCollection 2025.
3
Circulating FGF21 and Ketone Bodies Modify the Risk of MASLD and Mortality: Insights from the PREVEND Cohort Study.
循环中的成纤维细胞生长因子21和酮体改变代谢相关脂肪性肝病及死亡风险:来自预防肾脏和血管终末期疾病队列研究的见解
Int J Mol Sci. 2025 May 24;26(11):5059. doi: 10.3390/ijms26115059.
4
CD133 Expression in Circulating Tumor Cells as a Prognostic Marker in Colorectal Cancer.循环肿瘤细胞中CD133表达作为结直肠癌的预后标志物
Int J Mol Sci. 2025 May 15;26(10):4740. doi: 10.3390/ijms26104740.
5
Two-stage multiple imputation with a longitudinal composite variable.使用纵向复合变量的两阶段多重填补法。
BMC Med Res Methodol. 2025 May 6;25(1):124. doi: 10.1186/s12874-025-02555-9.
6
Novel CT Image-Based Intracerebral Bleeding Risk Score for Patients With Acute Ischemic Stroke Undergoing Thrombolysis.基于CT图像的新型急性缺血性卒中溶栓患者脑出血风险评分
J Am Heart Assoc. 2025 Feb 18;14(4):e037256. doi: 10.1161/JAHA.124.037256. Epub 2025 Feb 8.
7
Biomarker Panel Development Using Logic Regression in the Presence of Missing Data.在存在缺失数据的情况下使用逻辑回归进行生物标志物组合开发
N Engl J Stat Data Sci. 2024 Apr;2(1):3-14. doi: 10.51387/24-nejsds59. Epub 2024 Jan 31.
8
Risk factors and clinical significance of post-stroke incident ischemic lesions.中风后新发缺血性病变的危险因素及临床意义。
Alzheimers Dement. 2024 Dec;20(12):8412-8428. doi: 10.1002/alz.14274. Epub 2024 Oct 17.
9
Comparative cardiovascular and renal effectiveness of empagliflozin and dapagliflozin: Scandinavian cohort study.恩格列净与达格列净的心血管和肾脏相对有效性比较:斯堪的纳维亚队列研究。
Eur Heart J Cardiovasc Pharmacother. 2024 Aug 14;10(5):432-443. doi: 10.1093/ehjcvp/pvae045.
10
Systematic Review and Meta-Analysis of Prehospital Machine Learning Scores as Screening Tools for Early Detection of Large Vessel Occlusion in Patients With Suspected Stroke.系统评价和荟萃分析:院前机器学习评分作为疑似卒中患者早期检测大血管闭塞的筛查工具。
J Am Heart Assoc. 2024 Jun 18;13(12):e033298. doi: 10.1161/JAHA.123.033298. Epub 2024 Jun 14.