• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

缺失数据的多重插补:完全条件指定与多元正态插补。

Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation.

机构信息

Clinical Epidemiology and Biostatistics Unit, Murdoch Childrens Research Institute, Royal Children's Hospital, Flemington Road, Parkville, Victoria 3052, Australia.

出版信息

Am J Epidemiol. 2010 Mar 1;171(5):624-32. doi: 10.1093/aje/kwp425. Epub 2010 Jan 27.

DOI:10.1093/aje/kwp425
PMID:20106935
Abstract

Statistical analysis in epidemiologic studies is often hindered by missing data, and multiple imputation is increasingly being used to handle this problem. In a simulation study, the authors compared 2 methods for imputation that are widely available in standard software: fully conditional specification (FCS) or "chained equations" and multivariate normal imputation (MVNI). The authors created data sets of 1,000 observations to simulate a cohort study, and missing data were induced under 3 missing-data mechanisms. Imputations were performed using FCS (Royston's "ice") and MVNI (Schafer's NORM) in Stata (Stata Corporation, College Station, Texas), with transformations or prediction matching being used to manage nonnormality in the continuous variables. Inferences for a set of regression parameters were compared between these approaches and a complete-case analysis. As expected, both FCS and MVNI were generally less biased than complete-case analysis, and both produced similar results despite the presence of binary and ordinal variables that clearly did not follow a normal distribution. Ignoring skewness in a continuous covariate led to large biases and poor coverage for the corresponding regression parameter under both approaches, although inferences for other parameters were largely unaffected. These results provide reassurance that similar results can be expected from FCS and MVNI in a standard regression analysis involving variously scaled variables.

摘要

在流行病学研究中,统计分析常常受到缺失数据的阻碍,而多重插补越来越多地被用于处理这个问题。在一项模拟研究中,作者比较了两种广泛应用于标准软件的插补方法:完全条件指定(FCS)或“链式方程”和多元正态插补(MVNI)。作者创建了 1000 个观测值的数据集,以模拟队列研究,并在 3 种缺失数据机制下诱导缺失数据。使用 Stata(Stata Corporation,德克萨斯州College Station)中的 FCS(Royston 的“ice”)和 MVNI(Schafer 的 NORM)进行插补,并使用变换或预测匹配来管理连续变量中的非正态性。在这些方法和完整案例分析之间比较了一组回归参数的推断。正如预期的那样,FCS 和 MVNI 通常比完整案例分析的偏差更小,尽管存在明显不符合正态分布的二进制和有序变量,但两种方法都产生了相似的结果。在这两种方法下,忽略连续协变量的偏度都会导致相应回归参数的大偏差和较差的覆盖率,尽管其他参数的推断基本上不受影响。这些结果提供了保证,即在涉及各种比例变量的标准回归分析中,FCS 和 MVNI 可以产生类似的结果。

相似文献

1
Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation.缺失数据的多重插补:完全条件指定与多元正态插补。
Am J Epidemiol. 2010 Mar 1;171(5):624-32. doi: 10.1093/aje/kwp425. Epub 2010 Jan 27.
2
Multiple imputation for missing data via sequential regression trees.基于序贯回归树的缺失数据多重插补法。
Am J Epidemiol. 2010 Nov 1;172(9):1070-6. doi: 10.1093/aje/kwq260. Epub 2010 Sep 14.
3
[Multiple imputation of missing at random data: General points and presentation of a Monte-Carlo method].[随机缺失数据的多重填补:一般要点及一种蒙特卡罗方法的介绍]
Rev Epidemiol Sante Publique. 2009 Oct;57(5):361-72. doi: 10.1016/j.respe.2009.04.011. Epub 2009 Aug 11.
4
Multiple imputation of discrete and continuous data by fully conditional specification.通过完全条件设定对离散和连续数据进行多重填补
Stat Methods Med Res. 2007 Jun;16(3):219-42. doi: 10.1177/0962280206074463.
5
A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study.存在与时间呈非线性关联的时变协变量时,用于处理纵向数据中缺失值的多种多重填补方法的比较:一项模拟研究。
BMC Med Res Methodol. 2017 Jul 25;17(1):114. doi: 10.1186/s12874-017-0372-y.
6
Missing values in longitudinal dietary data: a multiple imputation approach based on a fully conditional specification.纵向膳食数据中的缺失值:基于完全条件指定的多重插补方法。
Stat Med. 2009 Dec 20;28(29):3657-69. doi: 10.1002/sim.3731.
7
Imputation strategies for missing continuous outcomes in cluster randomized trials.整群随机试验中连续缺失结局的插补策略。
Biom J. 2008 Jun;50(3):329-45. doi: 10.1002/bimj.200710423.
8
Multiple imputation for handling missing outcome data when estimating the relative risk.采用多重插补处理估计相对危险度时丢失的结局数据。
BMC Med Res Methodol. 2017 Sep 6;17(1):134. doi: 10.1186/s12874-017-0414-5.
9
Rounding strategies for multiply imputed binary data.多重填补二元数据的舍入策略。
Biom J. 2009 Aug;51(4):677-88. doi: 10.1002/bimj.200900018.
10
Missing data on the Center for Epidemiologic Studies Depression Scale: a comparison of 4 imputation techniques.流行病学研究中心抑郁量表的缺失数据:4种插补技术的比较
Res Social Adm Pharm. 2007 Mar;3(1):1-27. doi: 10.1016/j.sapharm.2006.04.001.

引用本文的文献

1
Seed quality drives grain yield in Ethiopian and Senegalese sorghum: Insights from machine learning.种子质量决定埃塞俄比亚和塞内加尔高粱的谷物产量:机器学习带来的见解
PLoS One. 2025 Aug 14;20(8):e0329366. doi: 10.1371/journal.pone.0329366. eCollection 2025.
2
Evaluating In-Hospital Arrhythmias in Critically Ill Acute Kidney Injury Patients: Predictive Models, Mortality Risks, and the Efficacy of Antiarrhythmic Drugs.评估危重症急性肾损伤患者的院内心律失常:预测模型、死亡风险及抗心律失常药物的疗效
J Clin Med. 2025 Jun 26;14(13):4552. doi: 10.3390/jcm14134552.
3
Caring for Grandchildren and Dementia Among Older Adults in China.
中国老年人照顾孙辈与痴呆症
JAMA Netw Open. 2025 Jul 1;8(7):e2519622. doi: 10.1001/jamanetworkopen.2025.19622.
4
Difference in long-term care cost obtained with the short-term intensive prevention service (day service type C): A 3-year follow-up study of Japanese older adults.短期强化预防服务(日间服务类型C)所带来的长期护理成本差异:对日本老年人的3年随访研究
Geriatr Gerontol Int. 2025 Aug;25(8):1058-1064. doi: 10.1111/ggi.70102. Epub 2025 Jun 19.
5
High burden of blindness at initial hospitalisation with primary angle-closure glaucoma in a national multicentre study in China.在中国一项全国多中心研究中,原发性闭角型青光眼患者初次住院时失明负担较重。
BMJ Open Ophthalmol. 2025 Jun 5;10(1):e001997. doi: 10.1136/bmjophth-2024-001997.
6
A Multiple Imputation Workflow for Handling Missing Covariate Data in Pharmacometrics Modeling.一种用于处理药代动力学建模中缺失协变量数据的多重填补工作流程。
CPT Pharmacometrics Syst Pharmacol. 2025 Jun;14(6):991-1005. doi: 10.1002/psp4.70039. Epub 2025 May 29.
7
Illicit Substance Use and Harm in Young Adulthood: the Role of Substance Use in Close Relationships and Individual Social Skills.青年期的非法药物使用与危害:亲密关系中药物使用及个人社交技能所起的作用
Int J Ment Health Addict. 2025;23(2):1503-1517. doi: 10.1007/s11469-023-01181-0. Epub 2023 Oct 27.
8
-associated parkinsonism with and without evidence of alpha-synuclein aggregates: longitudinal clinical and biomarker characterization.伴有和不伴有α-突触核蛋白聚集体证据的相关帕金森病:纵向临床和生物标志物特征分析
Brain Commun. 2025 Mar 6;7(2):fcaf103. doi: 10.1093/braincomms/fcaf103. eCollection 2025.
9
Amphetamine use and mental health difficulties across adolescence and young adulthood: An integrative data analysis of four Australasian cohort studies.青少年和青年期使用安非他命与心理健康问题:四项澳大拉西亚队列研究的综合数据分析。
Addiction. 2025 Aug;120(8):1623-1633. doi: 10.1111/add.70033. Epub 2025 Mar 15.
10
Cardiovascular Disease Risk Estimates in the US CKD Population Using the PREVENT Equation.使用PREVENT方程对美国慢性肾脏病患者群体进行心血管疾病风险评估。
Am J Kidney Dis. 2025 Mar 5. doi: 10.1053/j.ajkd.2025.01.012.