在病例对照数据分析中使用整个队列：在妇女健康倡议中的应用。

Using the Whole Cohort in the Analysis of Case-Control Data: Application to the Women's Health Initiative.

作者信息

Breslow Norman E, Amorim Gustavo, Pettinger Mary B, Rossouw Jacques

机构信息

Department of Biostatistics, University of Washington, Seattle, WA, USA, Tel.: +1-206-543-1044.

Department of Statistics, University of Auckland, Auckland, NZ.

出版信息

Stat Biosci. 2013 Nov 1;5(2). doi: 10.1007/s12561-013-9080-2.

DOI:10.1007/s12561-013-9080-2

PMID:24363785

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3865808/

Abstract

Standard analyses of data from case-control studies that are nested in a large cohort ignore information available for cohort members not sampled for the sub-study. This paper reviews several methods designed to increase estimation efficiency by using more of the data, treating the case-control sample as a two or three phase stratified sample. When applied to a study of coronary heart disease among women in the hormone trials of the Women's Health Initiative, modest but increasing gains in precision of regression coefficients were observed depending on the amount of cohort information used in the analysis. The gains were particularly evident for pseudo- or maximum likelihood estimates whose validity depends on the assumed model being correct. Larger standard errors were obtained for coefficients estimated by inverse probability weighted methods that are more robust to model misspecification. Such misspecification may have been responsible for an important difference in one key regression coefficient estimated using the weighted compared with the more efficient methods.

摘要

对嵌套在大型队列中的病例对照研究数据进行的标准分析，忽略了未被选入子研究抽样的队列成员的可用信息。本文回顾了几种旨在通过使用更多数据来提高估计效率的方法，将病例对照样本视为两阶段或三阶段分层样本。当应用于妇女健康倡议激素试验中女性冠心病的研究时，根据分析中使用的队列信息数量，观察到回归系数精度有适度但不断增加的提高。对于伪似然估计或最大似然估计，这种提高尤为明显，其有效性取决于所假定的模型是否正确。对于通过逆概率加权法估计的系数，得到的标准误更大，而逆概率加权法对模型误设更具稳健性。这种误设可能是使用加权法与更有效方法估计的一个关键回归系数存在重要差异的原因。

相似文献

Using the Whole Cohort in the Analysis of Case-Control Data: Application to the Women's Health Initiative.

Stat Biosci. 2013 Nov 1;5(2). doi: 10.1007/s12561-013-9080-2.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Augmented pseudo-likelihood estimation for two-phase studies.

Stat Methods Med Res. 2020 Feb;29(2):344-358. doi: 10.1177/0962280219833415. Epub 2019 Mar 5.

Using the whole cohort in the analysis of countermatched samples.

Biometrics. 2016 Jun;72(2):382-91. doi: 10.1111/biom.12419. Epub 2015 Sep 22.

Z-estimation and stratified samples: application to survival models.

Lifetime Data Anal. 2015 Oct;21(4):493-516. doi: 10.1007/s10985-014-9317-5. Epub 2015 Jan 15.

Coronary heart disease events in the Women's Health Initiative hormone trials: effect modification by metabolic syndrome: a nested case-control study within the Women's Health Initiative randomized clinical trials.

Menopause. 2013 Mar;20(3):254-60. doi: 10.1097/GME.0b013e31826f80e0.

Goodness-of-fit two-phase sampling designs for time-to-event outcomes: a simulation study based on New York University Women's Health Study for breast cancer.

BMC Med Res Methodol. 2023 May 19;23(1):119. doi: 10.1186/s12874-023-01950-4.

Weight calibration to improve the efficiency of pure risk estimates from case-control samples nested in a cohort.

Biometrics. 2020 Dec;76(4):1087-1097. doi: 10.1111/biom.13209. Epub 2020 Jan 2.

Best linear inverse probability weighted estimation for two-phase designs and missing covariate regression.

Stat Med. 2019 Jul 10;38(15):2783-2796. doi: 10.1002/sim.8141. Epub 2019 Mar 25.

引用本文的文献

Using model-assisted calibration methods to improve efficiency of regression analyses using two-phase samples or pooled samples under complex survey designs.

Biometrics. 2025 Jul 3;81(3). doi: 10.1093/biomtc/ujaf092.

Ascertainment Conditional Maximum Likelihood for Continuous Outcome Under Two-Phase Response-Selective Design.

Stat Med. 2025 Jul;44(15-17):e70111. doi: 10.1002/sim.70111.

Clonal Hematopoiesis of Indeterminate Potential and Incident Hypertension: Results From the Women's Health Initiative.

Hypertension. 2025 Apr;82(4):e70-e72. doi: 10.1161/HYPERTENSIONAHA.124.24482. Epub 2025 Mar 19.

Three-phase generalized raking and multiple imputation estimators to address error-prone data.

Stat Med. 2024 Jan 30;43(2):379-394. doi: 10.1002/sim.9967. Epub 2023 Nov 21.

Age at Menopause, Leukocyte Telomere Length, and Coronary Artery Disease in Postmenopausal Women.

Circ Res. 2023 Aug 18;133(5):376-386. doi: 10.1161/CIRCRESAHA.123.322984. Epub 2023 Jul 25.

National estimates from the Youth '19 Rangatahi smart survey: A survey calibration approach.

PLoS One. 2021 May 14;16(5):e0251177. doi: 10.1371/journal.pone.0251177. eCollection 2021.

Using Big Data to Estimate Dementia Prevalence in New Zealand: Protocol for an Observational Study.

JMIR Res Protoc. 2021 Jan 6;10(1):e20225. doi: 10.2196/20225.

Premature Menopause, Clonal Hematopoiesis, and Coronary Artery Disease in Postmenopausal Women.

Circulation. 2021 Feb 2;143(5):410-423. doi: 10.1161/CIRCULATIONAHA.120.051775. Epub 2020 Nov 9.

On the analysis of two-phase designs in cluster-correlated data settings.

Stat Med. 2019 Oct 15;38(23):4611-4624. doi: 10.1002/sim.8321. Epub 2019 Jul 29.

Improved calibration estimators for the total cost of health programs and application to immunization in Brazil.

PLoS One. 2019 Mar 6;14(3):e0212401. doi: 10.1371/journal.pone.0212401. eCollection 2019.

本文引用的文献

Connections between survey calibration estimators and semiparametric models for incomplete data.

Int Stat Rev. 2011 Aug;79(2):200-220. doi: 10.1111/j.1751-5823.2011.00138.x.

osDesign: An R Package for the Analysis, Evaluation, and Design of Two-Phase and Case-Control Studies.

J Stat Softw. 2011 Aug;43(11). doi: 10.18637/jss.v043.i11.

Multiple imputation analysis of case-cohort studies.

Stat Med. 2011 Jun 15;30(13):1595-607. doi: 10.1002/sim.4130. Epub 2011 Feb 24.

A Z-theorem with Estimated Nuisance Parameters and Correction Note for 'Weighted Likelihood for Semiparametric Models and Two-phase Stratified Samples, with Application to Cox Regression'.

Scand Stat Theory Appl. 2008 Mar 1;35(1):186-192. doi: 10.1111/j.1467-9469.2007.00574.x.

Improved Horvitz-Thompson Estimation of Model Parameters from Two-phase Stratified Samples: Applications in Epidemiology.

Stat Biosci. 2009 May 1;1(1):32. doi: 10.1007/s12561-009-9001-6.

Using the whole cohort in the analysis of case-cohort data.

Am J Epidemiol. 2009 Jun 1;169(11):1398-405. doi: 10.1093/aje/kwp055. Epub 2009 Apr 8.

Inflammatory, lipid, thrombotic, and genetic markers of coronary heart disease risk in the women's health initiative trials of hormone therapy.

Arch Intern Med. 2008 Nov 10;168(20):2245-53. doi: 10.1001/archinte.168.20.2245.

Semiparametric estimation exploiting covariate independence in two-phase randomized trials.

Biometrics. 2009 Mar;65(1):178-87. doi: 10.1111/j.1541-0420.2008.01046.x. Epub 2008 May 13.

Case-control inference of interaction between genetic and nongenetic risk factors under assumptions on their distribution.

Stat Appl Genet Mol Biol. 2007;6:Article13. doi: 10.2202/1544-6115.1270. Epub 2007 Apr 22.

Low-fat dietary pattern and risk of invasive breast cancer: the Women's Health Initiative Randomized Controlled Dietary Modification Trial.

JAMA. 2006 Feb 8;295(6):629-42. doi: 10.1001/jama.295.6.629.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在病例对照数据分析中使用整个队列：在妇女健康倡议中的应用。

Using the Whole Cohort in the Analysis of Case-Control Data: Application to the Women's Health Initiative.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献