处理流行病学研究中缺失值、测量误差和混杂的方法。

Approaches to addressing missing values, measurement error, and confounding in epidemiologic studies.

机构信息

Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands.

出版信息

J Clin Epidemiol. 2021 Mar;131:89-100. doi: 10.1016/j.jclinepi.2020.11.006. Epub 2020 Nov 8.

DOI:10.1016/j.jclinepi.2020.11.006

PMID:33176189

Abstract

OBJECTIVES

Epidemiologic studies often suffer from incomplete data, measurement error (or misclassification), and confounding. Each of these can cause bias and imprecision in estimates of exposure-outcome relations. We describe and compare statistical approaches that aim to control all three sources of bias simultaneously.

STUDY DESIGN AND SETTING

We illustrate four statistical approaches that address all three sources of bias, namely, multiple imputation for missing data and measurement error, multiple imputation combined with regression calibration, full information maximum likelihood within a structural equation modeling framework, and a Bayesian model. In a simulation study, we assess the performance of the four approaches compared with more commonly used approaches that do not account for measurement error, missing values, or confounding.

RESULTS

The results demonstrate that the four approaches consistently outperform the alternative approaches on all performance metrics (bias, mean squared error, and confidence interval coverage). Even in simulated data of 100 subjects, these approaches perform well.

CONCLUSION

There can be a large benefit of addressing measurement error, missing values, and confounding to improve the estimation of exposure-outcome relations, even when the available sample size is relatively small.

摘要

目的

流行病学研究常受到数据不完整、测量误差（或分类错误）和混杂因素的影响。这些因素都会导致暴露-结局关系的估计值产生偏差和不精确。我们描述并比较了旨在同时控制这三种偏倚源的统计方法。

研究设计和设置

我们举例说明了四种可同时解决所有三种偏倚源的统计方法，即：针对缺失数据和测量误差的多重插补、多重插补结合回归校正、结构方程建模框架内的完全信息极大似然法和贝叶斯模型。在一项模拟研究中，我们评估了这四种方法与那些不考虑测量误差、缺失值或混杂因素的常用方法相比的性能。

结果

结果表明，这四种方法在所有性能指标（偏差、均方误差和置信区间覆盖）上均优于替代方法。即使在 100 个受试者的模拟数据中，这些方法也表现良好。

结论

即使可用的样本量相对较小，解决测量误差、缺失值和混杂因素以改善暴露-结局关系的估计也会带来很大的益处。

相似文献

Approaches to addressing missing values, measurement error, and confounding in epidemiologic studies.处理流行病学研究中缺失值、测量误差和混杂的方法。

J Clin Epidemiol. 2021 Mar;131:89-100. doi: 10.1016/j.jclinepi.2020.11.006. Epub 2020 Nov 8.

Dealing with missing covariates in epidemiologic studies: a comparison between multiple imputation and a full Bayesian approach.流行病学研究中处理协变量缺失的问题：多重填补法与全贝叶斯方法的比较

Stat Med. 2016 Jul 30;35(17):2955-74. doi: 10.1002/sim.6944. Epub 2016 Apr 4.

Bayesian correction for covariate measurement error: A frequentist evaluation and comparison with regression calibration.贝叶斯校正协变量测量误差：频率论评价及与回归校正的比较。

Stat Methods Med Res. 2018 Jun;27(6):1695-1708. doi: 10.1177/0962280216667764. Epub 2016 Sep 28.

Multiple imputation with sequential penalized regression.多重插补与序贯惩罚回归。

Stat Methods Med Res. 2019 May;28(5):1311-1327. doi: 10.1177/0962280218755574. Epub 2018 Feb 16.

Multiple Imputation for Incomplete Data in Epidemiologic Studies.在流行病学研究中对不完全数据的多重插补。

Am J Epidemiol. 2018 Mar 1;187(3):576-584. doi: 10.1093/aje/kwx349.

Using Sensitivity Analyses for Unobserved Confounding to Address Covariate Measurement Error in Propensity Score Methods.利用敏感性分析解决未观察到的混杂因素对倾向评分法中协变量测量误差的影响。

Am J Epidemiol. 2018 Mar 1;187(3):604-613. doi: 10.1093/aje/kwx248.

Missing Data in Marginal Structural Models: A Plasmode Simulation Study Comparing Multiple Imputation and Inverse Probability Weighting.边缘结构模型中的缺失数据：比较多种插补和逆概率加权的 Plasmode 模拟研究。

Med Care. 2019 Mar;57(3):237-243. doi: 10.1097/MLR.0000000000001063.

Multiple imputation for handling missing outcome data when estimating the relative risk.采用多重插补处理估计相对危险度时丢失的结局数据。

BMC Med Res Methodol. 2017 Sep 6;17(1):134. doi: 10.1186/s12874-017-0414-5.

Use of multiple imputation in the epidemiologic literature.多重填补法在流行病学文献中的应用。

Am J Epidemiol. 2008 Aug 15;168(4):355-7. doi: 10.1093/aje/kwn071. Epub 2008 Jun 30.

Regression analysis with covariates that have heteroscedastic measurement error.具有异方差测量误差的协变量的回归分析。

Stat Med. 2011 Aug 15;30(18):2278-94. doi: 10.1002/sim.4261. Epub 2011 May 17.

引用本文的文献

Prognostic value of coagulation markers in locally advanced gastric cancer following neoadjuvant immunochemotherapy.新辅助免疫化疗后局部晚期胃癌凝血标志物的预后价值

World J Gastrointest Oncol. 2025 Aug 15;17(8):105099. doi: 10.4251/wjgo.v17.i8.105099.

The burden of cardiovascular disease attributable to dietary risk factors in China, 1990-2021.1990 - 2021年中国归因于饮食风险因素的心血管疾病负担

Sci Rep. 2025 Jul 15;15(1):25641. doi: 10.1038/s41598-025-11645-z.

Applying causal diagrams with measurement error: an outline and further considerations.带有测量误差的因果图应用：概述与进一步思考

Int J Epidemiol. 2025 Apr 12;54(3). doi: 10.1093/ije/dyaf070.

The effects of applying artificial intelligence to triage in the emergency department: A systematic review of prospective studies.人工智能在急诊科分诊中的应用效果：前瞻性研究的系统评价

J Nurs Scholarsh. 2025 Jan;57(1):105-118. doi: 10.1111/jnu.13024. Epub 2024 Sep 11.

Studying the association between longitudinal nondense breast tissue measurements and the risk of breast cancer: a joint modeling approach.研究纵向非致密乳腺组织测量与乳腺癌风险之间的关联：一种联合建模方法。

Am J Epidemiol. 2025 Apr 8;194(4):1065-1071. doi: 10.1093/aje/kwae196.

The robustness of the flow-gradient classification of severe aortic stenosis.重度主动脉瓣狭窄血流梯度分类的稳健性。

JTCVS Open. 2023 Sep 20;16:177-188. doi: 10.1016/j.xjon.2023.08.022. eCollection 2023 Dec.

Measurement error of pulse pressure variation.脉压变异测量误差。

J Clin Monit Comput. 2024 Apr;38(2):313-323. doi: 10.1007/s10877-023-01099-x. Epub 2023 Dec 8.

Data on SARS-CoV-2 events in animals: Mind the gap!关于动物中SARS-CoV-2事件的数据：注意差距！

One Health. 2023 Nov 8;17:100653. doi: 10.1016/j.onehlt.2023.100653. eCollection 2023 Dec.

Health Equity Implications of Missing Data Among Youths With Childhood-Onset Systemic Lupus Erythematosus: A Proof-of-Concept Study in the Childhood Arthritis and Rheumatology Research Alliance Registry.儿童期起病的系统性红斑狼疮青少年中数据缺失的健康公平影响：儿童关节炎和风湿病研究联盟注册研究的概念验证研究。

Arthritis Care Res (Hoboken). 2023 Nov;75(11):2285-2294. doi: 10.1002/acr.25136. Epub 2023 May 30.

Perceived empowerment and the impact of negative effects of the COVID-19 pandemic on the quality of life of persons with severe mental illness.感知赋权与 COVID-19 大流行对严重精神疾病患者生活质量的负面影响。

PLoS One. 2022 Oct 20;17(10):e0276123. doi: 10.1371/journal.pone.0276123. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

处理流行病学研究中缺失值、测量误差和混杂的方法。

Approaches to addressing missing values, measurement error, and confounding in epidemiologic studies.

机构信息

出版信息

OBJECTIVES

STUDY DESIGN AND SETTING

RESULTS

CONCLUSION

目的

研究设计和设置

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献