Suppr超能文献

基于两阶段分层样本的模型参数的改进霍维茨 - 汤普森估计:在流行病学中的应用

Improved Horvitz-Thompson Estimation of Model Parameters from Two-phase Stratified Samples: Applications in Epidemiology.

作者信息

Breslow Norman E, Lumley Thomas, Ballantyne Christie M, Chambless Lloyd E, Kulich Michal

机构信息

Department of Biostatistics, University of Washington, Seattle, WA, USA, Tel.: +1-206-543-2035.

出版信息

Stat Biosci. 2009 May 1;1(1):32. doi: 10.1007/s12561-009-9001-6.

Abstract

The case-cohort study involves two-phase sampling: simple random sampling from an infinite super-population at phase one and stratified random sampling from a finite cohort at phase two. Standard analyses of case-cohort data involve solution of inverse probability weighted (IPW) estimating equations, with weights determined by the known phase two sampling fractions. The variance of parameter estimates in (semi)parametric models, including the Cox model, is the sum of two terms: (i) the model based variance of the usual estimates that would be calculated if full data were available for the entire cohort; and (ii) the design based variance from IPW estimation of the unknown cohort total of the efficient influence function (IF) contributions. This second variance component may be reduced by adjusting the sampling weights, either by calibration to known cohort totals of auxiliary variables correlated with the IF contributions or by their estimation using these same auxiliary variables. Both adjustment methods are implemented in the R survey package. We derive the limit laws of coefficients estimated using adjusted weights. The asymptotic results suggest practical methods for construction of auxiliary variables that are evaluated by simulation of case-cohort samples from the National Wilms Tumor Study and by log-linear modeling of case-cohort data from the Atherosclerosis Risk in Communities Study. Although not semiparametric efficient, estimators based on adjusted weights may come close to achieving full efficiency within the class of augmented IPW estimators.

摘要

病例队列研究涉及两阶段抽样

第一阶段从无限超总体中进行简单随机抽样,第二阶段从有限队列中进行分层随机抽样。病例队列数据的标准分析涉及求解逆概率加权(IPW)估计方程,权重由已知的第二阶段抽样比例确定。(半)参数模型(包括Cox模型)中参数估计的方差是两项之和:(i)如果整个队列有完整数据时通常估计的基于模型的方差;(ii)来自有效影响函数(IF)贡献的未知队列总数的IPW估计的基于设计的方差。可以通过调整抽样权重来减少第二个方差分量,要么通过校准与IF贡献相关的辅助变量的已知队列总数,要么通过使用这些相同的辅助变量对其进行估计。这两种调整方法都在R调查包中实现。我们推导了使用调整权重估计的系数的极限定律。渐近结果提出了构建辅助变量的实用方法,这些方法通过对国家肾母细胞瘤研究的病例队列样本进行模拟以及对社区动脉粥样硬化风险研究的病例队列数据进行对数线性建模来评估。尽管不是半参数有效的,但基于调整权重的估计器可能在增强IPW估计器类中接近实现完全效率。

相似文献

7
Z-estimation and stratified samples: application to survival models.Z估计与分层样本:在生存模型中的应用
Lifetime Data Anal. 2015 Oct;21(4):493-516. doi: 10.1007/s10985-014-9317-5. Epub 2015 Jan 15.

引用本文的文献

5
Weight calibration in the joint modelling of medical cost and mortality.医疗费用和死亡率联合建模中的体重校准。
Stat Methods Med Res. 2024 Apr;33(4):728-742. doi: 10.1177/09622802241236935. Epub 2024 Mar 6.
7
Use of nonsteroidal anti-inflammatory drugs and poor olfaction in women.非甾体抗炎药的使用与女性嗅觉差有关。
Int Forum Allergy Rhinol. 2024 Mar;14(3):639-650. doi: 10.1002/alr.23241. Epub 2023 Aug 7.

本文引用的文献

3
Using the whole cohort in the analysis of case-cohort data.在病例队列数据分析中使用整个队列。
Am J Epidemiol. 2009 Jun 1;169(11):1398-405. doi: 10.1093/aje/kwp055. Epub 2009 Apr 8.
7
Exposure stratified case-cohort designs.暴露分层病例队列设计。
Lifetime Data Anal. 2000 Mar;6(1):39-58. doi: 10.1023/a:1009661900674.
8
Analysis of case-cohort designs.病例队列设计分析。
J Clin Epidemiol. 1999 Dec;52(12):1165-72. doi: 10.1016/s0895-4356(99)00102-x.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验