• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通用数据的联合插补

Joint Imputation of General Data.

作者信息

Robbins Michael W

机构信息

Senior Statistician with the RAND Corporation, Pittsburgh, PA 15213, USA.

出版信息

J Surv Stat Methodol. 2023 Sep 12;12(1):183-210. doi: 10.1093/jssam/smad034. eCollection 2024 Feb.

DOI:10.1093/jssam/smad034
PMID:38282960
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10810676/
Abstract

High-dimensional complex survey data of general structures (e.g., containing continuous, binary, categorical, and ordinal variables), such as the US Department of Defense's Health-Related Behaviors Survey (HRBS), often confound procedures designed to impute any missing survey data. Imputation by fully conditional specification (FCS) is often considered the state of the art for such datasets due to its generality and flexibility. However, FCS procedures contain a theoretical flaw that is exposed by HRBS data-HRBS imputations created with FCS are shown to diverge across iterations of Markov Chain Monte Carlo. Imputation by joint modeling lacks this flaw; however, current joint modeling procedures are neither general nor flexible enough to handle HRBS data. As such, we introduce an algorithm that efficiently and flexibly applies multiple imputation by joint modeling in data of general structures. This procedure draws imputations from a latent joint multivariate normal model that underpins the generally structured data and models the latent data via a sequence of conditional linear models, the predictors of which can be specified by the user. We perform rigorous evaluations of HRBS imputations created with the new algorithm and show that they are convergent and of high quality. Lastly, simulations verify that the proposed method performs well compared to existing algorithms including FCS.

摘要

一般结构的高维复杂调查数据(例如,包含连续、二元、分类和有序变量),如美国国防部的健康相关行为调查(HRBS),常常使旨在估算任何缺失调查数据的程序变得复杂。由于其通用性和灵活性,通过完全条件指定(FCS)进行插补通常被认为是处理此类数据集的先进方法。然而,FCS程序存在一个理论缺陷,这一缺陷在HRBS数据中暴露出来——用FCS创建的HRBS插补在马尔可夫链蒙特卡罗的迭代过程中会发散。通过联合建模进行插补不存在这个缺陷;然而,当前的联合建模程序在处理HRBS数据时既不够通用也不够灵活。因此,我们引入了一种算法,该算法能够在一般结构的数据中高效灵活地应用联合建模进行多次插补。此程序从一个潜在的联合多元正态模型中进行插补,该模型支撑着一般结构的数据,并通过一系列条件线性模型对潜在数据进行建模,用户可以指定这些模型的预测变量。我们对用新算法创建的HRBS插补进行了严格评估,结果表明它们是收敛的且质量很高。最后,模拟验证了与包括FCS在内的现有算法相比,所提出的方法表现良好。

相似文献

1
Joint Imputation of General Data.通用数据的联合插补
J Surv Stat Methodol. 2023 Sep 12;12(1):183-210. doi: 10.1093/jssam/smad034. eCollection 2024 Feb.
2
Multiple imputation for discrete data: Evaluation of the joint latent normal model.离散数据的多重填补:联合潜在正态模型的评估
Biom J. 2019 Jul;61(4):1003-1019. doi: 10.1002/bimj.201800222. Epub 2019 Mar 14.
3
Multiple imputation in the presence of an incomplete binary variable created from an underlying continuous variable.在存在由潜在连续变量创建的不完整二元变量的情况下进行多重填补。
Biom J. 2020 Mar;62(2):467-478. doi: 10.1002/bimj.201900011. Epub 2019 Jul 15.
4
Multiple imputation of discrete and continuous data by fully conditional specification.通过完全条件设定对离散和连续数据进行多重填补
Stat Methods Med Res. 2007 Jun;16(3):219-42. doi: 10.1177/0962280206074463.
5
A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study.存在与时间呈非线性关联的时变协变量时,用于处理纵向数据中缺失值的多种多重填补方法的比较:一项模拟研究。
BMC Med Res Methodol. 2017 Jul 25;17(1):114. doi: 10.1186/s12874-017-0372-y.
6
Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation.缺失数据的多重插补:完全条件指定与多元正态插补。
Am J Epidemiol. 2010 Mar 1;171(5):624-32. doi: 10.1093/aje/kwp425. Epub 2010 Jan 27.
7
Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study.在大型流行病学研究中采用全条件设定多重填补法处理缺失数据
Int J Stat Med Res. 2015;4(3):287-295. doi: 10.6000/1929-6029.2015.04.03.7. Epub 2015 Aug 19.
8
Review and evaluation of imputation methods for multivariate longitudinal data with mixed-type incomplete variables.多元纵向混合缺失数据插补方法的评价与研究
Stat Med. 2022 Dec 30;41(30):5844-5876. doi: 10.1002/sim.9592. Epub 2022 Oct 11.
9
Multiple imputation methods for handling missing values in a longitudinal categorical variable with restrictions on transitions over time: a simulation study.多种插补方法处理具有时间过渡限制的纵向分类变量中的缺失值:一项模拟研究。
BMC Med Res Methodol. 2019 Jan 10;19(1):14. doi: 10.1186/s12874-018-0653-0.
10
Multiple imputation for missing values through conditional Semiparametric odds ratio models.通过条件半参数比值比模型对缺失值进行多重填补。
Biometrics. 2011 Sep;67(3):799-809. doi: 10.1111/j.1541-0420.2010.01538.x. Epub 2011 Jan 6.

引用本文的文献

1
A Real-World Comparison Between Adjuvant Docetaxel with Cyclophosphamide (TC) and Anthracycline-Taxane Chemotherapy in Early HER-2 Negative Breast Cancer.早期HER-2阴性乳腺癌中辅助多西他赛联合环磷酰胺(TC)与蒽环类-紫杉烷类化疗的真实世界比较
Curr Oncol. 2024 Dec 25;32(1):6. doi: 10.3390/curroncol32010006.

本文引用的文献

1
Multiple imputation of missing data in multilevel models with the R package mdmb: a flexible sequential modeling approach.多水平模型中缺失数据的多重插补:使用 R 包 mdmb 的灵活序贯建模方法。
Behav Res Methods. 2021 Dec;53(6):2631-2649. doi: 10.3758/s13428-020-01530-0. Epub 2021 May 23.
2
Multiple imputation in the presence of non-normal data.非正态数据情况下的多重填补
Stat Med. 2017 Feb 20;36(4):606-617. doi: 10.1002/sim.7173. Epub 2016 Nov 15.
3
Analysis of sparse data in logistic regression in medical research: A newer approach.医学研究中逻辑回归稀疏数据的分析:一种新方法。
J Postgrad Med. 2016 Jan-Mar;62(1):26-31. doi: 10.4103/0022-3859.173193.
4
Comparison of random forest and parametric imputation models for imputing missing data using MICE: a CALIBER study.基于 MICE 使用随机森林和参数插补模型比较缺失数据插补:CALIBER 研究。
Am J Epidemiol. 2014 Mar 15;179(6):764-74. doi: 10.1093/aje/kwt312. Epub 2014 Jan 12.
5
Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.随机松弛,吉布斯分布,以及贝叶斯图像恢复。
IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41. doi: 10.1109/tpami.1984.4767596.
6
Combining multiple imputation and inverse-probability weighting.结合多重填补法和逆概率加权法。
Biometrics. 2012 Mar;68(1):129-37. doi: 10.1111/j.1541-0420.2011.01666.x. Epub 2011 Nov 3.
7
Multiple imputation using chained equations: Issues and guidance for practice.使用链式方程进行多重插补:实践中的问题和指导。
Stat Med. 2011 Feb 20;30(4):377-99. doi: 10.1002/sim.4067. Epub 2010 Nov 30.
8
Multiple imputation for missing data via sequential regression trees.基于序贯回归树的缺失数据多重插补法。
Am J Epidemiol. 2010 Nov 1;172(9):1070-6. doi: 10.1093/aje/kwq260. Epub 2010 Sep 14.
9
Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation.缺失数据的多重插补:完全条件指定与多元正态插补。
Am J Epidemiol. 2010 Mar 1;171(5):624-32. doi: 10.1093/aje/kwp425. Epub 2010 Jan 27.
10
Multiple imputation in a large-scale complex survey: a practical guide.大规模复杂调查中的多重插补:实用指南。
Stat Methods Med Res. 2010 Dec;19(6):653-70. doi: 10.1177/0962280208101273. Epub 2009 Aug 4.