Suppr超能文献

基于结果依赖抽样设计的数据的次要结局分析。

Secondary outcome analysis for data from an outcome-dependent sampling design.

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.

Epidemiology Branch, National Institute of Environmental Health Sciences, Research Triangle Park, NC, USA.

出版信息

Stat Med. 2018 Jul 10;37(15):2321-2337. doi: 10.1002/sim.7672. Epub 2018 Apr 22.

Abstract

Outcome-dependent sampling (ODS) scheme is a cost-effective way to conduct a study. For a study with continuous primary outcome, an ODS scheme can be implemented where the expensive exposure is only measured on a simple random sample and supplemental samples selected from 2 tails of the primary outcome variable. With the tremendous cost invested in collecting the primary exposure information, investigators often would like to use the available data to study the relationship between a secondary outcome and the obtained exposure variable. This is referred as secondary analysis. Secondary analysis in ODS designs can be tricky, as the ODS sample is not a random sample from the general population. In this article, we use the inverse probability weighted and augmented inverse probability weighted estimating equations to analyze the secondary outcome for data obtained from the ODS design. We do not make any parametric assumptions on the primary and secondary outcome and only specify the form of the regression mean models, thus allow an arbitrary error distribution. Our approach is robust to second- and higher-order moment misspecification. It also leads to more precise estimates of the parameters by effectively using all the available participants. Through simulation studies, we show that the proposed estimator is consistent and asymptotically normal. Data from the Collaborative Perinatal Project are analyzed to illustrate our method.

摘要

基于结果的抽样(ODS)方案是进行研究的一种具有成本效益的方法。对于具有连续主要结局的研究,可以实施 ODS 方案,其中昂贵的暴露仅在简单随机样本上进行测量,并从主要结局变量的 2 个尾部选择补充样本。由于在收集主要暴露信息方面投入了大量成本,研究人员通常希望利用现有数据研究次要结局与获得的暴露变量之间的关系。这被称为二次分析。在 ODS 设计中的二次分析可能很棘手,因为 ODS 样本不是总体人群中的随机样本。在本文中,我们使用逆概率加权和增强逆概率加权估计方程来分析从 ODS 设计中获得的数据的次要结局。我们对主要和次要结局没有任何参数假设,仅指定回归均值模型的形式,因此允许任意误差分布。我们的方法对二阶和更高阶矩的指定不敏感。通过有效利用所有可用的参与者,它还可以更准确地估计参数。通过仿真研究,我们证明了所提出的估计量是一致的和渐近正态的。对合作围产期项目的数据进行了分析,以说明我们的方法。

相似文献

1
Secondary outcome analysis for data from an outcome-dependent sampling design.
Stat Med. 2018 Jul 10;37(15):2321-2337. doi: 10.1002/sim.7672. Epub 2018 Apr 22.
4
Statistical inference for the additive hazards model under outcome-dependent sampling.
Can J Stat. 2015 Sep;43(3):436-453. doi: 10.1002/cjs.11257.
5
Statistical inference for a two-stage outcome-dependent sampling design with a continuous outcome.
Biometrics. 2011 Mar;67(1):194-202. doi: 10.1111/j.1541-0420.2010.01446.x.
6
Best linear inverse probability weighted estimation for two-phase designs and missing covariate regression.
Stat Med. 2019 Jul 10;38(15):2783-2796. doi: 10.1002/sim.8141. Epub 2019 Mar 25.
7
Estimation of a partially linear additive model for data from an outcome-dependent sampling design with a continuous outcome.
Biostatistics. 2016 Oct;17(4):663-76. doi: 10.1093/biostatistics/kxw015. Epub 2016 Mar 22.
8
A two-step semiparametric method to accommodate sampling weights in multiple imputation.
Biometrics. 2016 Mar;72(1):242-52. doi: 10.1111/biom.12413. Epub 2015 Sep 22.
9
Combined estimating equation approaches for semiparametric transformation models with length-biased survival data.
Biometrics. 2014 Sep;70(3):608-18. doi: 10.1111/biom.12170. Epub 2014 Apr 18.
10
Outcome-dependent sampling with interval-censored failure time data.
Biometrics. 2018 Mar;74(1):58-67. doi: 10.1111/biom.12744. Epub 2017 Aug 3.

引用本文的文献

本文引用的文献

1
Semiparametric Estimation in the Secondary Analysis of Case-Control Studies.
J R Stat Soc Series B Stat Methodol. 2016 Jan;78(1):127-151. doi: 10.1111/rssb.12107. Epub 2015 Feb 15.
2
Robust estimation for homoscedastic regression in the secondary analysis of case-control data.
J R Stat Soc Series B Stat Methodol. 2013 Jan 1;75(1):185-206. doi: 10.1111/j.1467-9868.2012.01052.x.
3
Semiparametric inference for a 2-stage outcome-auxiliary-dependent sampling design with continuous outcome.
Biostatistics. 2011 Jul;12(3):521-34. doi: 10.1093/biostatistics/kxq080. Epub 2011 Jan 20.
4
Partial linear inference for a 2-stage outcome-dependent sampling design with a continuous outcome.
Biostatistics. 2011 Jul;12(3):506-20. doi: 10.1093/biostatistics/kxq070. Epub 2010 Dec 14.
6
Maternal serum preconception polychlorinated biphenyl concentrations and infant birth weight.
Environ Health Perspect. 2010 Feb;118(2):297-302. doi: 10.1289/ehp.0901150.
8
Design and inference for cancer biomarker study with an outcome and auxiliary-dependent subsampling.
Biometrics. 2010 Jun;66(2):502-11. doi: 10.1111/j.1541-0420.2009.01280.x. Epub 2009 Jun 9.
9
Genome-wide association scans for secondary traits using case-control samples.
Genet Epidemiol. 2009 Dec;33(8):717-28. doi: 10.1002/gepi.20424.
10
Proper analysis of secondary phenotype data in case-control association studies.
Genet Epidemiol. 2009 Apr;33(3):256-65. doi: 10.1002/gepi.20377.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验