• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

平衡效能与计算负担:可靠量表中项目无应答的加权均值、多重填补和逆概率加权法

Balancing efficacy and computational burden: weighted mean, multiple imputation, and inverse probability weighting methods for item non-response in reliable scales.

作者信息

Guide Andrew, Garbett Shawn, Feng Xiaoke, Mapes Brandy M, Cook Justin, Sulieman Lina, Cronin Robert M, Chen Qingxia

机构信息

Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

出版信息

J Am Med Inform Assoc. 2024 Dec 1;31(12):2869-2879. doi: 10.1093/jamia/ocae217.

DOI:10.1093/jamia/ocae217
PMID:39138951
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11631082/
Abstract

IMPORTANCE

Scales often arise from multi-item questionnaires, yet commonly face item non-response. Traditional solutions use weighted mean (WMean) from available responses, but potentially overlook missing data intricacies. Advanced methods like multiple imputation (MI) address broader missing data, but demand increased computational resources. Researchers frequently use survey data in the All of Us Research Program (All of Us), and it is imperative to determine if the increased computational burden of employing MI to handle non-response is justifiable.

OBJECTIVES

Using the 5-item Physical Activity Neighborhood Environment Scale (PANES) in All of Us, this study assessed the tradeoff between efficacy and computational demands of WMean, MI, and inverse probability weighting (IPW) when dealing with item non-response.

MATERIALS AND METHODS

Synthetic missingness, allowing 1 or more item non-response, was introduced into PANES across 3 missing mechanisms and various missing percentages (10%-50%). Each scenario compared WMean of complete questions, MI, and IPW on bias, variability, coverage probability, and computation time.

RESULTS

All methods showed minimal biases (all <5.5%) for good internal consistency, with WMean suffered most with poor consistency. IPW showed considerable variability with increasing missing percentage. MI required significantly more computational resources, taking >8000 and >100 times longer than WMean and IPW in full data analysis, respectively.

DISCUSSION AND CONCLUSION

The marginal performance advantages of MI for item non-response in highly reliable scales do not warrant its escalated cloud computational burden in All of Us, particularly when coupled with computationally demanding post-imputation analyses. Researchers using survey scales with low missingness could utilize WMean to reduce computing burden.

摘要

重要性

量表通常源自多项目问卷,但常常面临项目无应答的情况。传统方法使用可用应答的加权均值(WMean),但可能忽略了缺失数据的复杂性。诸如多重填补(MI)等先进方法可处理更广泛的缺失数据,但需要更多的计算资源。研究人员经常在“我们所有人”研究计划(All of Us)中使用调查数据,因此必须确定采用MI来处理无应答所增加的计算负担是否合理。

目的

本研究使用“我们所有人”研究计划中的5项身体活动邻里环境量表(PANES),评估了在处理项目无应答时,WMean、MI和逆概率加权(IPW)在功效和计算需求之间的权衡。

材料与方法

通过3种缺失机制和不同的缺失百分比(10%-50%),将允许1个或多个项目无应答的合成缺失引入到PANES中。每个场景都比较了完整问题的WMean、MI和IPW在偏差、变异性、覆盖概率和计算时间方面的表现。

结果

在内部一致性良好的情况下,所有方法的偏差均最小(均<5.5%),其中WMean在一致性较差时受影响最大。随着缺失百分比的增加,IPW显示出相当大的变异性。MI需要显著更多的计算资源,在完整数据分析中分别比WMean和IPW长8000多倍和100多倍。

讨论与结论

对于高度可靠量表中的项目无应答,MI的边际性能优势并不能证明其在“我们所有人”研究计划中增加的云计算负担是合理的,特别是当与计算要求高的插补后分析相结合时。使用缺失率低的调查量表的研究人员可以使用WMean来减轻计算负担。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/8b06835939f7/ocae217f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/4e9f1ab57e47/ocae217f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/b5e50b603607/ocae217f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/93899f31d5a4/ocae217f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/8b06835939f7/ocae217f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/4e9f1ab57e47/ocae217f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/b5e50b603607/ocae217f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/93899f31d5a4/ocae217f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4724/11631082/8b06835939f7/ocae217f4.jpg

相似文献

1
Balancing efficacy and computational burden: weighted mean, multiple imputation, and inverse probability weighting methods for item non-response in reliable scales.平衡效能与计算负担:可靠量表中项目无应答的加权均值、多重填补和逆概率加权法
J Am Med Inform Assoc. 2024 Dec 1;31(12):2869-2879. doi: 10.1093/jamia/ocae217.
2
On the use of multiple imputation to address data missing by design as well as unintended missing data in case-cohort studies with a binary endpoint.关于在以二分类结局为研究终点的病例-队列研究中,针对设计缺失和非故意缺失数据,采用多重填补方法进行处理。
BMC Med Res Methodol. 2023 Dec 7;23(1):287. doi: 10.1186/s12874-023-02090-5.
3
Evaluation of multiple imputation approaches for handling missing covariate information in a case-cohort study with a binary outcome.评价在二分类结局病例-对照研究中采用多种插补方法处理协变量缺失信息的效果。
BMC Med Res Methodol. 2022 Apr 3;22(1):87. doi: 10.1186/s12874-021-01495-4.
4
Missing Data in Marginal Structural Models: A Plasmode Simulation Study Comparing Multiple Imputation and Inverse Probability Weighting.边缘结构模型中的缺失数据:比较多种插补和逆概率加权的 Plasmode 模拟研究。
Med Care. 2019 Mar;57(3):237-243. doi: 10.1097/MLR.0000000000001063.
5
Evaluation of predictive model performance of an existing model in the presence of missing data.评估存在缺失数据时现有模型的预测模型性能。
Stat Med. 2021 Jul 10;40(15):3477-3498. doi: 10.1002/sim.8978. Epub 2021 Apr 11.
6
Review of inverse probability weighting for dealing with missing data.逆概率加权法处理缺失数据的综述。
Stat Methods Med Res. 2013 Jun;22(3):278-95. doi: 10.1177/0962280210395740. Epub 2011 Jan 10.
7
Dealing with indeterminate outcomes in antimalarial drug efficacy trials: a comparison between complete case analysis, multiple imputation and inverse probability weighting.处理抗疟药物疗效试验中的不确定结局:完全病例分析、多重插补和逆概率加权法的比较。
BMC Med Res Methodol. 2019 Nov 27;19(1):215. doi: 10.1186/s12874-019-0856-z.
8
Combining multiple imputation and inverse-probability weighting.结合多重填补法和逆概率加权法。
Biometrics. 2012 Mar;68(1):129-37. doi: 10.1111/j.1541-0420.2011.01666.x. Epub 2011 Nov 3.
9
Missing data in a multi-item instrument were best handled by multiple imputation at the item score level.多项目量表中的缺失数据最好在项目得分层面采用多重插补处理。
J Clin Epidemiol. 2014 Mar;67(3):335-42. doi: 10.1016/j.jclinepi.2013.09.009. Epub 2013 Dec 2.
10
Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量:应如何使用多重插补?
Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.

引用本文的文献

1
Returning value to communities from the All of Us Research Program through innovative approaches for data use, analysis, dissemination, and research capacity building.通过创新的数据使用、分析、传播和研究能力建设方法,将价值回馈给“我们所有人”研究计划中的各个社区。
J Am Med Inform Assoc. 2024 Dec 1;31(12):2773-2780. doi: 10.1093/jamia/ocae276.

本文引用的文献

1
Measuring social determinants of health in the All of Us Research Program.测量“全民研究计划”中的健康社会决定因素。
Sci Rep. 2024 Apr 16;14(1):8815. doi: 10.1038/s41598-024-57410-6.
2
The Data and Research Center: Creating a Secure, Scalable, and Sustainable Ecosystem for Biomedical Research.数据与研究中心:为生物医学研究创建安全、可扩展和可持续的生态系统。
Annu Rev Biomed Data Sci. 2023 Aug 10;6:443-464. doi: 10.1146/annurev-biodatasci-122120-104825.
3
Standard multiple imputation of survey data didn't perform better than simple substitution in enhancing an administrative dataset: the example of self-rated health in England.
在增强行政数据集方面,调查数据的标准多重填补并不比简单替换表现得更好:以英格兰的自评健康为例。
Emerg Themes Epidemiol. 2021 Jul 24;18(1):9. doi: 10.1186/s12982-021-00099-z.
4
Diversity and inclusion for the All of Us research program: A scoping review.《全民研究计划的多样性和包容性:范围综述》。
PLoS One. 2020 Jul 1;15(7):e0234962. doi: 10.1371/journal.pone.0234962. eCollection 2020.
5
The "All of Us" Research Program.“All of Us”研究计划。
N Engl J Med. 2019 Aug 15;381(7):668-676. doi: 10.1056/NEJMsr1809937.
6
Development of the Initial Surveys for the All of Us Research Program.全美国研究计划初始调查问卷的制定。
Epidemiology. 2019 Jul;30(4):597-608. doi: 10.1097/EDE.0000000000001028.
7
Accounting for missing data in statistical analyses: multiple imputation is not always the answer.在统计分析中处理缺失数据:多重插补并不总是答案。
Int J Epidemiol. 2019 Aug 1;48(4):1294-1304. doi: 10.1093/ije/dyz032.
8
Correction to: multiple imputation for patient reported outcome measures in randomised controlled trials: advantages and disadvantages of imputing at the item, subscale or composite score level.对以下内容的更正:随机对照试验中患者报告结局指标的多重填补:在条目、子量表或综合得分水平进行填补的优缺点
BMC Med Res Methodol. 2018 Oct 16;18(1):107. doi: 10.1186/s12874-018-0563-1.
9
Multiple imputation for patient reported outcome measures in randomised controlled trials: advantages and disadvantages of imputing at the item, subscale or composite score level.在随机对照试验中对患者报告结局测量进行多重插补:在项目、分量表或综合评分层面插补的优缺点。
BMC Med Res Methodol. 2018 Aug 28;18(1):87. doi: 10.1186/s12874-018-0542-6.
10
Should multiple imputation be the method of choice for handling missing data in randomized trials?在随机试验中,处理缺失数据时是否应选择多重插补法?
Stat Methods Med Res. 2018 Sep;27(9):2610-2626. doi: 10.1177/0962280216683570. Epub 2016 Dec 19.