缺失或错误分类纳入标准的研究中的多重插补方差估计。

Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria.

出版信息

Am J Epidemiol. 2020 Dec 1;189(12):1628-1632. doi: 10.1093/aje/kwaa153.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7705600/

Abstract

In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin ("Rubin's rules" (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.

摘要

在使用常规收集数据进行观察性研究中，缺失值或分类错误率较高的变量可能会决定观察结果是否纳入分析。在使用插补后评估纳入标准的情况下，由于插补和分析模型之间不兼容，Rubin 提出的流行的多重插补方差估计量（“Rubin 规则”（RR））会产生偏差。虽然存在替代方法，但大多数分析师并不熟悉它们。我们使用人类免疫缺陷病毒队列的部分验证数据，说明了 Robins 和 Wang（RW）提出的插补方差估计量在研究排除标准基于必须插补的变量的情况下的计算。在这个示例中，使用 RW 估计量时，对数优势的相应插补方差估计值比 RR 估计量小 29%。我们进一步将这两种方差估计量与模拟研究进行了比较，结果表明基于 RR 估计量的 95%置信区间的覆盖率概率过高，并且随着更多的观察值被插补以及更多的受试者被排除在分析之外，覆盖率概率变得更差。RW 插补方差估计量的性能要好得多，当插补和分析模型之间存在不兼容时，应使用该方法。我们提供了分析代码，以帮助未来的分析师实施这种方法。

相似文献

Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria.

Am J Epidemiol. 2020 Dec 1;189(12):1628-1632. doi: 10.1093/aje/kwaa153.

Comparison of imputation variance estimators.

Stat Methods Med Res. 2016 Dec;25(6):2541-2557. doi: 10.1177/0962280214526216. Epub 2014 Mar 28.

Propensity score analysis with partially observed covariates: How should multiple imputation be used?

Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.

On the multiple imputation variance estimator for control-based and delta-adjusted pattern mixture models.

Biometrics. 2017 Dec;73(4):1379-1387. doi: 10.1111/biom.12702. Epub 2017 Apr 13.

Regression multiple imputation for missing data analysis.

Stat Methods Med Res. 2020 Sep;29(9):2647-2664. doi: 10.1177/0962280220908613. Epub 2020 Mar 4.

Combining multiple imputation and meta-analysis with individual participant data.

Stat Med. 2013 Nov 20;32(26):4499-514. doi: 10.1002/sim.5844. Epub 2013 May 24.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Towards a More Accurate Differential Analysis of Multiple Imputed Proteomics Data with mi4limma.

Methods Mol Biol. 2023;2426:131-140. doi: 10.1007/978-1-0716-1967-4_7.

On inference of control-based imputation for analysis of repeated binary outcomes with missing data.

J Biopharm Stat. 2017;27(3):358-372. doi: 10.1080/10543406.2017.1289957. Epub 2017 Feb 7.

Distributional imputation for the analysis of censored recurrent events.

Stat Med. 2024 Jun 15;43(13):2622-2640. doi: 10.1002/sim.10087. Epub 2024 Apr 29.

引用本文的文献

Metabolomic evaluation of air pollution-related bone damage and potential mediation in Women's Health Initiative participants.

J Bone Miner Res. 2025 Jun 25;40(7):834-846. doi: 10.1093/jbmr/zjaf059.

Associations of preoperative anaemia with healthcare resource use and outcomes after colorectal surgery: a population-based cohort study.

Br J Anaesth. 2024 Jul;133(1):58-66. doi: 10.1016/j.bja.2024.03.018. Epub 2024 Apr 20.

Three-phase generalized raking and multiple imputation estimators to address error-prone data.

Stat Med. 2024 Jan 30;43(2):379-394. doi: 10.1002/sim.9967. Epub 2023 Nov 21.

Target Trial Emulation and Bias Through Missing Eligibility Data: An Application to a Study of Palivizumab for the Prevention of Hospitalization Due to Infant Respiratory Illness.

Am J Epidemiol. 2023 Apr 6;192(4):600-611. doi: 10.1093/aje/kwac202.

Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

Stat Commun Infect Dis. 2020 Oct 7;12(Suppl1):20190015. doi: 10.1515/scid-2019-0015. eCollection 2020 Sep 1.

Impact of the Changes in the Frequency of Social Participation on All-Cause Mortality in Japanese Older Adults: A Nationwide Longitudinal Study.

Int J Environ Res Public Health. 2021 Dec 27;19(1):270. doi: 10.3390/ijerph19010270.

本文引用的文献

ACCOUNTING FOR DEPENDENT ERRORS IN PREDICTORS AND TIME-TO-EVENT OUTCOMES USING ELECTRONIC HEALTH RECORDS, VALIDATION SAMPLES, AND MULTIPLE IMPUTATION.

Ann Appl Stat. 2020 Jun;14(2):1045-1061. doi: 10.1214/20-aoas1343. Epub 2020 Jun 29.

Considerations for analysis of time-to-event outcomes measured with error: Bias and correction with SIMEX.

Stat Med. 2018 Apr 15;37(8):1276-1289. doi: 10.1002/sim.7554. Epub 2017 Nov 29.

Comparison of imputation variance estimators.

Stat Methods Med Res. 2016 Dec;25(6):2541-2557. doi: 10.1177/0962280214526216. Epub 2014 Mar 28.

Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data.

Am J Epidemiol. 2013 May 1;177(9):904-12. doi: 10.1093/aje/kws340. Epub 2013 Apr 4.

Using audit information to adjust parameter estimates for data errors in clinical trials.

Clin Trials. 2012 Dec;9(6):721-9. doi: 10.1177/1740774512450100. Epub 2012 Jul 30.

On the Assessment of Monte Carlo Error in Simulation-Based Statistical Analyses.

Am Stat. 2009 May 1;63(2):155-162. doi: 10.1198/tast.2009.0030.

Multiple-imputation for measurement-error correction.

Int J Epidemiol. 2006 Aug;35(4):1074-81. doi: 10.1093/ije/dyl097. Epub 2006 May 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

缺失或错误分类纳入标准的研究中的多重插补方差估计。

Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria.

出版信息

Am J Epidemiol. 2020 Dec 1;189(12):1628-1632. doi: 10.1093/aje/kwaa153.

DOI:10.1093/aje/kwaa153

PMID:32685964

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7705600/

Abstract

摘要

缺失或错误分类纳入标准的研究中的多重插补方差估计。

Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria.

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

缺失或错误分类纳入标准的研究中的多重插补方差估计。

Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria.

出版信息