Zhou Fan, Zhou Haibo, Li Tengfei, Zhu Hongtu
Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina.
Department of Radiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina.
Biometrics. 2020 Jun;76(2):606-618. doi: 10.1111/biom.13157. Epub 2019 Nov 11.
Although case-control association studies have been widely used, they are insufficient for many complex diseases, such as Alzheimer's disease and breast cancer, since these diseases may have multiple subtypes with distinct morphologies and clinical implications. Many multigroup studies, such as the Alzheimer's Disease Neuroimaging Initiative (ADNI), have been undertaken by recruiting subjects based on their multiclass primary disease status, while extensive secondary outcomes have been collected. The aim of this paper is to develop a general regression framework for the analysis of secondary phenotypes collected in multigroup association studies. Our regression framework is built on a conditional model for the secondary outcome given the multigroup status and covariates and its relationship with the population regression of interest of the secondary outcome given the covariates. Then, we develop generalized estimation equations to estimate the parameters of interest. We use both simulations and a large-scale imaging genetic data analysis from the ADNI to evaluate the effect of the multigroup sampling scheme on standard genome-wide association analyses based on linear regression methods, while comparing it with our statistical methods that appropriately adjust for the multigroup sampling scheme. Data used in preparation of this article were obtained from the ADNI database.
尽管病例对照关联研究已被广泛应用,但对于许多复杂疾病,如阿尔茨海默病和乳腺癌,它们并不适用,因为这些疾病可能有多种具有不同形态和临床意义的亚型。许多多组研究,如阿尔茨海默病神经影像学倡议(ADNI),通过根据受试者的多类原发性疾病状态招募受试者,并收集了广泛的次要结局。本文的目的是为多组关联研究中收集的次要表型分析开发一个通用回归框架。我们的回归框架基于给定多组状态和协变量时次要结局的条件模型,以及它与给定协变量时次要结局的总体回归兴趣的关系。然后,我们开发广义估计方程来估计感兴趣的参数。我们使用模拟和来自ADNI的大规模成像遗传数据分析来评估多组抽样方案对基于线性回归方法的标准全基因组关联分析的影响,同时将其与我们适当调整多组抽样方案的统计方法进行比较。本文编写过程中使用的数据来自ADNI数据库。