• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A computationally efficient sequential regression imputation algorithm for multilevel data.一种用于多级数据的计算高效的序贯回归插补算法。
J Appl Stat. 2023 Nov 6;51(11):2258-2278. doi: 10.1080/02664763.2023.2277669. eCollection 2024.
2
Multiple imputation by chained equations for systematically and sporadically missing multilevel data.多水平数据系统缺失和随机缺失的链方程多重插补法。
Stat Methods Med Res. 2018 Jun;27(6):1634-1649. doi: 10.1177/0962280216666564. Epub 2016 Sep 19.
3
Review and evaluation of imputation methods for multivariate longitudinal data with mixed-type incomplete variables.多元纵向混合缺失数据插补方法的评价与研究
Stat Med. 2022 Dec 30;41(30):5844-5876. doi: 10.1002/sim.9592. Epub 2022 Oct 11.
4
Hierarchical imputation of systematically and sporadically missing data: An approximate Bayesian approach using chained equations.系统和偶发性缺失数据的分层插补:一种使用链式方程的近似贝叶斯方法。
Biom J. 2018 Mar;60(2):333-351. doi: 10.1002/bimj.201600220. Epub 2017 Oct 9.
5
SuperMICE: An Ensemble Machine Learning Approach to Multiple Imputation by Chained Equations.超级小鼠:一种基于链式方程的多重填补集成机器学习方法。
Am J Epidemiol. 2022 Feb 19;191(3):516-525. doi: 10.1093/aje/kwab271.
6
Multilevel multiple imputation: A review and evaluation of joint modeling and chained equations imputation.多层次多重插补:联合建模和链式方程插补的综述与评价。
Psychol Methods. 2016 Jun;21(2):222-40. doi: 10.1037/met0000063. Epub 2015 Dec 21.
7
Multiple imputation of multiple multi-item scales when a full imputation model is infeasible.当完全插补模型不可行时对多个多项目量表进行多重插补。
BMC Res Notes. 2016 Jan 26;9:45. doi: 10.1186/s13104-016-1853-5.
8
Multiple imputation with missing data indicators.带有缺失数据指标的多重插补。
Stat Methods Med Res. 2021 Dec;30(12):2685-2700. doi: 10.1177/09622802211047346. Epub 2021 Oct 13.
9
Multiple imputation with sequential penalized regression.多重插补与序贯惩罚回归。
Stat Methods Med Res. 2019 May;28(5):1311-1327. doi: 10.1177/0962280218755574. Epub 2018 Feb 16.
10
The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study.预后模型的性能取决于缺失值插补算法的选择:一项模拟研究。
J Clin Epidemiol. 2024 Dec;176:111539. doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.

本文引用的文献

1
A fully conditional specification approach to multilevel imputation of categorical and continuous variables.一种完全条件规范方法,用于对分类变量和连续变量进行多级插补。
Psychol Methods. 2018 Jun;23(2):298-317. doi: 10.1037/met0000148. Epub 2017 May 29.
2
Multiple imputation by chained equations for systematically and sporadically missing multilevel data.多水平数据系统缺失和随机缺失的链方程多重插补法。
Stat Methods Med Res. 2018 Jun;27(6):1634-1649. doi: 10.1177/0962280216666564. Epub 2016 Sep 19.
3
Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data.高维数据存在时一般缺失数据模式的多重填补
Sci Rep. 2016 Feb 12;6:21689. doi: 10.1038/srep21689.
4
Joint modelling rationale for chained equations.联立方程的联合建模原理。
BMC Med Res Methodol. 2014 Feb 21;14:28. doi: 10.1186/1471-2288-14-28.
5
Risk factors for low birth weight in New York state counties.纽约州各县低出生体重的风险因素。
Policy Polit Nurs Pract. 2012 Feb;13(1):17-26. doi: 10.1177/1527154412442391. Epub 2012 May 14.
6
Gaussian-based routines to impute categorical variables in health surveys.基于高斯分布的方法来推断健康调查中的分类变量。
Stat Med. 2011 Dec 20;30(29):3447-60. doi: 10.1002/sim.4355. Epub 2011 Oct 4.
7
Quantifying the impact of fixed effects modeling of clusters in multiple imputation for cluster randomized trials.量化整群随机试验多重填补中整群固定效应建模的影响。
Biom J. 2011 Feb;53(1):57-74. doi: 10.1002/bimj.201000140.
8
Multiple imputation for missing data via sequential regression trees.基于序贯回归树的缺失数据多重插补法。
Am J Epidemiol. 2010 Nov 1;172(9):1070-6. doi: 10.1093/aje/kwq260. Epub 2010 Sep 14.
9
Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation.缺失数据的多重插补:完全条件指定与多元正态插补。
Am J Epidemiol. 2010 Mar 1;171(5):624-32. doi: 10.1093/aje/kwp425. Epub 2010 Jan 27.
10
How many imputations are really needed? Some practical clarifications of multiple imputation theory.究竟需要多少次插补?多重插补理论的一些实际阐释。
Prev Sci. 2007 Sep;8(3):206-13. doi: 10.1007/s11121-007-0070-9. Epub 2007 Jun 5.

一种用于多级数据的计算高效的序贯回归插补算法。

A computationally efficient sequential regression imputation algorithm for multilevel data.

作者信息

Akkaya Hocagil Tugba, Yucel Recai M

机构信息

Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada.

Department of Epidemiology and Biostatistics, Temple University, Philadelphia, PA, USA.

出版信息

J Appl Stat. 2023 Nov 6;51(11):2258-2278. doi: 10.1080/02664763.2023.2277669. eCollection 2024.

DOI:10.1080/02664763.2023.2277669
PMID:39157267
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11328800/
Abstract

Due to the computational burden, especially in high-dimensional settings, sequential imputation may not be practical. In this paper, we adopt computationally advantageous methods by sampling the missing data from their perspective predictive distributions, which leads to significantly improved computation time in the class of variable-by-variable imputation algorithms. We assess the computational performance in a comprehensive simulation study. We then compare and contrast the performance of our algorithm with commonly used alternatives. The results show that our method has a significant advantage over the commonly used alternatives with respect to computational efficiency and inferential quality. Finally, we demonstrate our methods in a substantive problem aimed at investigating the effects of area-level behavioral, socioeconomic, and demographic characteristics on poor birth outcomes in New York State among singleton births.

摘要

由于计算负担,特别是在高维情况下,顺序插补可能不切实际。在本文中,我们通过从其预测分布中对缺失数据进行采样来采用计算上更具优势的方法,这使得逐个变量插补算法类别中的计算时间显著缩短。我们在一项全面的模拟研究中评估计算性能。然后,我们将我们算法的性能与常用的替代方法进行比较和对比。结果表明,在计算效率和推理质量方面,我们的方法比常用的替代方法具有显著优势。最后,我们在一个实质性问题中展示了我们的方法,该问题旨在研究纽约州单胎出生中地区层面的行为、社会经济和人口特征对不良出生结局的影响。