• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

辅助变量选择策略在多重插补中的比较。

A comparison of strategies for selecting auxiliary variables for multiple imputation.

机构信息

Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Parkville, Victoria, Australia.

Department of Paediatrics, The University of Melbourne, Parkville, Victoria, Australia.

出版信息

Biom J. 2024 Jan;66(1):e2200291. doi: 10.1002/bimj.202200291.

DOI:10.1002/bimj.202200291
PMID:38285405
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7615727/
Abstract

Multiple imputation (MI) is a popular method for handling missing data. Auxiliary variables can be added to the imputation model(s) to improve MI estimates. However, the choice of which auxiliary variables to include is not always straightforward. Several data-driven auxiliary variable selection strategies have been proposed, but there has been limited evaluation of their performance. Using a simulation study we evaluated the performance of eight auxiliary variable selection strategies: (1, 2) two versions of selection based on correlations in the observed data; (3) selection using hypothesis tests of the "missing completely at random" assumption; (4) replacing auxiliary variables with their principal components; (5, 6) forward and forward stepwise selection; (7) forward selection based on the estimated fraction of missing information; and (8) selection via the least absolute shrinkage and selection operator (LASSO). A complete case analysis and an MI analysis using all auxiliary variables (the "full model") were included for comparison. We also applied all strategies to a motivating case study. The full model outperformed all auxiliary variable selection strategies in the simulation study, with the LASSO strategy the best performing auxiliary variable selection strategy overall. All MI analysis strategies that we were able to apply to the case study led to similar estimates, although computational time was substantially reduced when variable selection was employed. This study provides further support for adopting an inclusive auxiliary variable strategy where possible. Auxiliary variable selection using the LASSO may be a promising alternative when the full model fails or is too burdensome.

摘要

多重插补(MI)是处理缺失数据的一种常用方法。可以向插补模型中添加辅助变量以提高 MI 估计值。但是,选择包含哪些辅助变量并不总是那么简单。已经提出了几种基于数据的辅助变量选择策略,但对其性能的评估有限。我们使用模拟研究评估了八种辅助变量选择策略的性能:(1)两种基于观测数据相关性的选择版本;(2)基于“完全随机缺失”假设的假设检验的选择;(3)用辅助变量的主成分替换辅助变量;(4)向前和逐步向前选择;(5)基于估计缺失信息量的分数的向前选择;(6)基于最小绝对收缩和选择算子(LASSO)的选择。为了进行比较,还包括完全案例分析和使用所有辅助变量的 MI 分析(“完整模型”)。我们还将所有策略应用于一个动机案例研究。在模拟研究中,完整模型的表现优于所有辅助变量选择策略,LASSO 策略是整体表现最好的辅助变量选择策略。我们能够应用于案例研究的所有 MI 分析策略都导致了相似的估计值,尽管当使用变量选择时计算时间大大减少。这项研究进一步支持在可能的情况下采用包容性辅助变量策略。当完整模型失败或过于繁琐时,使用 LASSO 的辅助变量选择可能是一种有前途的替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/7d1373dcc3d3/BIMJ-66-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/a967328fd4f9/BIMJ-66-0-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/649c1146a975/BIMJ-66-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/5725299034d5/BIMJ-66-0-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/7d1373dcc3d3/BIMJ-66-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/a967328fd4f9/BIMJ-66-0-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/649c1146a975/BIMJ-66-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/5725299034d5/BIMJ-66-0-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/10952544/7d1373dcc3d3/BIMJ-66-0-g001.jpg

相似文献

1
A comparison of strategies for selecting auxiliary variables for multiple imputation.辅助变量选择策略在多重插补中的比较。
Biom J. 2024 Jan;66(1):e2200291. doi: 10.1002/bimj.202200291.
2
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
3
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
8
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
9
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
10
Clinical symptoms, signs and tests for identification of impending and current water-loss dehydration in older people.老年人即将发生和当前失水脱水的识别的临床症状、体征及检查
Cochrane Database Syst Rev. 2015 Apr 30;2015(4):CD009647. doi: 10.1002/14651858.CD009647.pub2.

引用本文的文献

1
Human Papillomavirus Positivity and Cognitive Function in Older U.S. Adults: A Cross-Sectional Population-Based Study.美国老年成年人的人乳头瘤病毒阳性与认知功能:一项基于人群的横断面研究。
Pathogens. 2025 May 21;14(5):508. doi: 10.3390/pathogens14050508.
2
Acceptability and Efficacy of a Web-Based, Intuitive Eating-Focused Single Session Intervention for Recurrent Binge Eating: A Randomized Controlled Trial.一项针对复发性暴饮暴食的基于网络的、以直觉饮食为重点的单次干预的可接受性和有效性:一项随机对照试验。
Int J Eat Disord. 2025 Aug;58(8):1547-1557. doi: 10.1002/eat.24466. Epub 2025 May 15.

本文引用的文献

1
A comparison of multiple imputation strategies for handling missing data in multi-item scales: Guidance for longitudinal studies.多项目量表中缺失数据处理的多种插补策略比较:对纵向研究的指导。
Stat Med. 2021 Sep 20;40(21):4660-4674. doi: 10.1002/sim.9088. Epub 2021 Jun 8.
2
Practical strategies for handling breakdown of multiple imputation procedures.处理多重填补程序故障的实用策略。
Emerg Themes Epidemiol. 2021 Apr 1;18(1):5. doi: 10.1186/s12982-021-00095-3.
3
Multiple imputation methods for handling missing values in longitudinal studies with sampling weights: Comparison of methods implemented in Stata.
多重插补方法处理纵向研究中带有抽样权重的缺失值:Stata 中实现方法的比较。
Biom J. 2021 Feb;63(2):354-371. doi: 10.1002/bimj.201900360. Epub 2020 Oct 25.
4
The Sisters' Advantage? Broader Autism Phenotype Characteristics and Young Adults' Sibling Support.姐妹的优势?更广泛的自闭症特征和年轻人的兄弟姐妹支持。
J Autism Dev Disord. 2019 Oct;49(10):4256-4267. doi: 10.1007/s10803-019-04139-1.
5
Autonomy-related Parenting Processes and Adolescent Adjustment in Latinx Immigrant Families.自主相关的育儿过程与拉丁裔移民家庭青少年的适应
J Youth Adolesc. 2019 Jun;48(6):1161-1174. doi: 10.1007/s10964-019-01010-5. Epub 2019 Mar 7.
6
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.
7
Canonical Causal Diagrams to Guide the Treatment of Missing Data in Epidemiologic Studies.规范因果图指导流行病学研究中缺失数据的处理。
Am J Epidemiol. 2018 Dec 1;187(12):2705-2715. doi: 10.1093/aje/kwy173.
8
Parents' Social Comparisons of Siblings and Youth Problem Behavior: A Moderated Mediation Model.父母对兄弟姐妹的社会比较与青少年问题行为:一个有调节的中介模型。
J Youth Adolesc. 2018 Oct;47(10):2088-2099. doi: 10.1007/s10964-018-0865-y. Epub 2018 Jun 18.
9
The Intersection of Emotional and Sociocognitive Competencies with Civic Engagement in Middle Childhood and Adolescence.情绪和社会认知能力与青少年中期公民参与的交叉。
J Youth Adolesc. 2018 Aug;47(8):1663-1683. doi: 10.1007/s10964-018-0842-5. Epub 2018 Mar 23.
10
How does School Experience Relate to Adolescent Identity Formation Over Time? Cross-Lagged Associations between School Engagement, School Burnout and Identity Processing Styles.学校经历如何随时间影响青少年身份形成?学校参与、学校倦怠与身份处理方式的交叉滞后关联。
J Youth Adolesc. 2018 Apr;47(4):760-774. doi: 10.1007/s10964-017-0806-1. Epub 2018 Jan 12.