通过多重填补在巢式病例对照研究和病例队列研究中使用全队列数据。

Using full-cohort data in nested case-control and case-cohort studies by multiple imputation.

作者信息

Keogh Ruth H, White Ian R

机构信息

MRC Biostatistics Unit, Cambridge, U.K.; Department of Medical Statistics, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, U.K.

出版信息

Stat Med. 2013 Oct 15;32(23):4021-43. doi: 10.1002/sim.5818. Epub 2013 Apr 23.

DOI:10.1002/sim.5818

PMID:23613433

Abstract

In many large prospective cohorts, expensive exposure measurements cannot be obtained for all individuals. Exposure-disease association studies are therefore often based on nested case-control or case-cohort studies in which complete information is obtained only for sampled individuals. However, in the full cohort, there may be a large amount of information on cheaply available covariates and possibly a surrogate of the main exposure(s), which typically goes unused. We view the nested case-control or case-cohort study plus the remainder of the cohort as a full-cohort study with missing data. Hence, we propose using multiple imputation (MI) to utilise information in the full cohort when data from the sub-studies are analysed. We use the fully observed data to fit the imputation models. We consider using approximate imputation models and also using rejection sampling to draw imputed values from the true distribution of the missing values given the observed data. Simulation studies show that using MI to utilise full-cohort information in the analysis of nested case-control and case-cohort studies can result in important gains in efficiency, particularly when a surrogate of the main exposure is available in the full cohort. In simulations, this method outperforms counter-matching in nested case-control studies and a weighted analysis for case-cohort studies, both of which use some full-cohort information. Approximate imputation models perform well except when there are interactions or non-linear terms in the outcome model, where imputation using rejection sampling works well.

摘要

在许多大型前瞻性队列研究中，无法为所有个体获取昂贵的暴露测量数据。因此，暴露-疾病关联研究通常基于巢式病例对照研究或病例队列研究，在这些研究中，仅对抽样个体获取完整信息。然而，在整个队列中，可能存在大量关于廉价可得协变量以及可能的主要暴露替代指标的信息，而这些信息通常未被利用。我们将巢式病例对照研究或病例队列研究以及队列的其余部分视为一个存在缺失数据的全队列研究。因此，我们建议在分析子研究数据时使用多重填补（MI）来利用全队列中的信息。我们使用完全观测到的数据来拟合填补模型。我们考虑使用近似填补模型，也考虑使用拒绝抽样从给定观测数据的缺失值真实分布中抽取填补值。模拟研究表明，在巢式病例对照研究和病例队列研究的分析中使用MI来利用全队列信息可显著提高效率，特别是当全队列中存在主要暴露的替代指标时。在模拟中，该方法在巢式病例对照研究中优于配对对照，在病例队列研究中优于加权分析，后两者都使用了一些全队列信息。近似填补模型表现良好，除非结局模型中存在交互作用或非线性项，此时使用拒绝抽样进行填补效果良好。

相似文献

Using full-cohort data in nested case-control and case-cohort studies by multiple imputation.通过多重填补在巢式病例对照研究和病例队列研究中使用全队列数据。

Stat Med. 2013 Oct 15;32(23):4021-43. doi: 10.1002/sim.5818. Epub 2013 Apr 23.

Multiple imputation of missing data in nested case-control and case-cohort studies.巢式病例对照研究和病例队列研究中缺失数据的多重填补

Biometrics. 2018 Dec;74(4):1438-1449. doi: 10.1111/biom.12910. Epub 2018 Jun 5.

Handling missing data in matched case-control studies using multiple imputation.使用多重填补法处理配对病例对照研究中的缺失数据。

Biometrics. 2015 Dec;71(4):1150-9. doi: 10.1111/biom.12358. Epub 2015 Aug 3.

Multiple imputation analysis of case-cohort studies.病例-对照研究的多重填补分析。

Stat Med. 2011 Jun 15;30(13):1595-607. doi: 10.1002/sim.4130. Epub 2011 Feb 24.

The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study.预后模型的性能取决于缺失值插补算法的选择：一项模拟研究。

J Clin Epidemiol. 2024 Dec;176:111539. doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.

Evaluation of multiple imputation approaches for handling missing covariate information in a case-cohort study with a binary outcome.评价在二分类结局病例-对照研究中采用多种插补方法处理协变量缺失信息的效果。

BMC Med Res Methodol. 2022 Apr 3;22(1):87. doi: 10.1186/s12874-021-01495-4.

Fitting additive hazards models for case-cohort studies: a multiple imputation approach.病例队列研究的相加风险模型拟合：一种多重填补方法。

Stat Med. 2016 Jul 30;35(17):2975-90. doi: 10.1002/sim.6588. Epub 2015 Jul 20.

Nested case-control studies: should one break the matching?巢式病例对照研究：是否应该打破匹配？

Lifetime Data Anal. 2015 Oct;21(4):517-41. doi: 10.1007/s10985-015-9319-y. Epub 2015 Jan 23.

Missing data and imputation: a practical illustration in a prognostic study on low back pain.缺失数据与插补：腰痛预后研究中的实际例证

J Manipulative Physiol Ther. 2012 Jul;35(6):464-71. doi: 10.1016/j.jmpt.2012.07.002.

Nonlinear multiple imputation for continuous covariate within semiparametric Cox model: application to HIV data in Senegal.半参数 Cox 模型中连续协变量的非线性多重插补：在塞内加尔 HIV 数据中的应用。

Stat Med. 2013 Nov 20;32(26):4651-65. doi: 10.1002/sim.5854. Epub 2013 May 28.

引用本文的文献

Importance of Circulating Leptin and Adiponectin in the Causal Pathways Between Obesity and the Development of Colorectal Cancer in Japanese Men.循环瘦素和脂联素在日本男性肥胖与结直肠癌发生因果途径中的重要性。

J Epidemiol. 2024 Dec 5;34(12):563-569. doi: 10.2188/jea.JE20230148. Epub 2024 Sep 30.

On the use of multiple imputation to address data missing by design as well as unintended missing data in case-cohort studies with a binary endpoint.关于在以二分类结局为研究终点的病例-队列研究中，针对设计缺失和非故意缺失数据，采用多重填补方法进行处理。

BMC Med Res Methodol. 2023 Dec 7;23(1):287. doi: 10.1186/s12874-023-02090-5.

Mortality and Morbidity Effects of Long-Term Exposure to Low-Level PM, BC, NO, and O: An Analysis of European Cohorts in the ELAPSE Project.长期暴露于低水平 PM、BC、NO 和 O 对死亡率和发病率的影响：ELAPSE 项目中欧洲队列的分析。

Res Rep Health Eff Inst. 2021 Sep;2021(208):1-127.

Feature screening for case-cohort studies with failure time outcome.具有生存时间结局的病例队列研究的特征筛选

Scand Stat Theory Appl. 2021 Mar;48(1):349-370. doi: 10.1111/sjos.12503. Epub 2020 Nov 16.

Risk Ratio and Risk Difference Estimation in Case-cohort Studies.病例-队列研究中的风险比和风险差估计。

J Epidemiol. 2023 Oct 5;33(10):508-513. doi: 10.2188/jea.JE20210509. Epub 2022 Oct 19.

BMC Med Res Methodol. 2022 Apr 3;22(1):87. doi: 10.1186/s12874-021-01495-4.

Combining multiple imputation with raking of weights: An efficient and robust approach in the setting of nearly true models.结合多重插补和加权排序：在几乎真实模型设定下的有效和稳健方法。

Stat Med. 2021 Dec 30;40(30):6777-6791. doi: 10.1002/sim.9210. Epub 2021 Sep 28.

Common maternal infections during pregnancy and childhood leukaemia in the offspring: findings from six international birth cohorts.孕期常见母体感染与儿童期白血病：来自六个国际出生队列的研究结果。

Int J Epidemiol. 2022 Jun 13;51(3):769-777. doi: 10.1093/ije/dyab199.

Conditional screening for ultrahigh-dimensional survival data in case-cohort studies.病例-队列研究中超高维生存数据的条件筛选。

Lifetime Data Anal. 2021 Oct;27(4):632-661. doi: 10.1007/s10985-021-09531-7. Epub 2021 Aug 20.

Regression Analysis of Case-cohort Studies in the Presence of Dependent Interval Censoring.存在相依区间删失情况下病例队列研究的回归分析

J Appl Stat. 2021;48(5):846-865. doi: 10.1080/02664763.2020.1752633. Epub 2020 Apr 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过多重填补在巢式病例对照研究和病例队列研究中使用全队列数据。

Using full-cohort data in nested case-control and case-cohort studies by multiple imputation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献