de Andrade Mariza, Mazo Lopera Mauricio A, Duarte Nubia E
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, 55905, USA.
Escuela de Estadística, Universidad Nacional de Colombia, Medellín, Antioquia, 050022, Colombia.
Stat Appl Genet Mol Biol. 2020 May 5;19(2):sagmb-2019-0030. doi: 10.1515/sagmb-2019-0030.
Genome wide association study (GWAS) is becoming fundamental in the arduous task of deciphering the etiology of complex diseases. The majority of the statistical models used to address the genes-disease association consider a single response variable. However, it is common for certain diseases to have correlated phenotypes such as in cardiovascular diseases. Usually, GWAS typically sample unrelated individuals from a population and the shared familial risk factors are not investigated. In this paper, we propose to apply a bivariate model using family data that associates two phenotypes with a genetic region. Using generalized estimation equations (GEE), we model two phenotypes, either discrete, continuous or a mixture of them, as a function of genetic variables and other important covariates. We incorporate the kinship relationships into the working matrix extended to a bivariate analysis. The estimation method and the joint gene-set effect in both phenotypes are developed in this work. We also evaluate the proposed methodology with a simulation study and an application to real data.
全基因组关联研究(GWAS)在破解复杂疾病病因这一艰巨任务中正变得至关重要。用于处理基因与疾病关联的大多数统计模型都考虑单一反应变量。然而,某些疾病存在相关表型的情况很常见,比如在心血管疾病中。通常,GWAS 通常从人群中抽取无关个体进行抽样,而不研究共享的家族风险因素。在本文中,我们提议应用一种使用家族数据的双变量模型,该模型将两种表型与一个遗传区域相关联。使用广义估计方程(GEE),我们将两种表型(离散型、连续型或它们的混合类型)建模为遗传变量和其他重要协变量的函数。我们将亲属关系纳入扩展到双变量分析的工作矩阵中。本文开发了估计方法以及两种表型中的联合基因集效应。我们还通过模拟研究和对实际数据的应用来评估所提出的方法。