Melton Phillip E, Peralta Juan M, Almasy Laura
The Curtin/UWA Centre for Genetic Origins of Health and Disease, Faculty of Health Sciences, Curtin University and Faculty of Medicine Dentistry & Health Sciences, The University of Western Australia, Perth, Australia.
South Texas Diabetes and Obesity Institute, University of Texas at Brownsville, Brownsville, TX 78520 USA.
BMC Proc. 2016 Oct 18;10(Suppl 7):329-332. doi: 10.1186/s12919-016-0051-8. eCollection 2016.
The incorporation of longitudinal data into genetic epidemiological studies has the potential to provide valuable information regarding the effect of time on complex disease etiology. Yet, the majority of research focuses on variables collected from a single time point. This aim of this study was to test for main effects on a quantitative trait across time points using a constrained maximum-likelihood measured genotype approach. This method simultaneously accounts for all repeat measurements of a phenotype in families. We applied this method to systolic blood pressure (SBP) measurements from three time points using the Genetic Analysis Workshop 19 (GAW19) whole-genome sequence family simulated data set and 200 simulated replicates. Data consisted of 849 individuals from 20 extended Mexican American pedigrees. Comparisons were made among 3 statistical approaches: (a) constrained, where the effect of a variant or gene region on the mean trait value was constrained to be equal across all measurements; (b) unconstrained, where the variant or gene region effect was estimated separately for each time point; and (c) the average SBP measurement from three time points. These approaches were run for nine genetic variants with known effect sizes (>0.001) for SBP variability and a known gene-centric kernel -based test under the GAW19 simulation model across 200 replicates.
When compared to results using two time points, the constrained method utilizing all 3 time points increased power to detect association. Averaging SBP was equally effective when the variant has a large effect on the phenotype, but less powerful for variants with lower effect sizes. However, averaging SBP was far more effective than either the constrained or unconstrained approaches when using a gene-centric kernel-based test.
We determined that this constrained multivariate approach improves genetic signal over the bivariate method. However, this method is still only effective in those variants that explain a moderate to large proportion of the phenotypic variance but is not as effective for gene-centric tests.
将纵向数据纳入遗传流行病学研究有可能提供关于时间对复杂疾病病因影响的有价值信息。然而,大多数研究集中于从单个时间点收集的变量。本研究的目的是使用受限最大似然测量基因型方法来检验跨时间点对数量性状的主要影响。该方法同时考虑了家庭中表型的所有重复测量。我们使用遗传分析研讨会19(GAW19)全基因组序列家庭模拟数据集和200个模拟重复样本,将此方法应用于三个时间点的收缩压(SBP)测量。数据包括来自20个墨西哥裔美国人扩展家系的849名个体。对三种统计方法进行了比较:(a)受限方法,其中变异或基因区域对平均性状值的影响在所有测量中被约束为相等;(b)非受限方法,其中变异或基因区域的影响在每个时间点分别估计;(c)三个时间点的平均SBP测量值。这些方法针对九个已知效应大小(>0.001)影响SBP变异性的遗传变异以及GAW19模拟模型下基于基因中心核的已知检验,在200个重复样本中运行。
与使用两个时间点的结果相比,利用所有三个时间点的受限方法提高了检测关联的效能。当变异对表型有较大影响时,平均SBP同样有效,但对效应大小较低的变异效能较低。然而,在使用基于基因中心核的检验时,平均SBP比受限或非受限方法都有效得多。
我们确定这种受限多变量方法比双变量方法改善了遗传信号。然而,该方法仍然仅对那些解释中等至大部分表型变异的变异有效,而对基于基因中心的检验效果不佳。