1 Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, USA.
2 Office of Biostatistics Research, National Heart, Lung and Blood Institute, Bethesda, USA.
Stat Methods Med Res. 2018 May;27(5):1464-1475. doi: 10.1177/0962280216662071. Epub 2016 Aug 8.
Genetic association studies often collect information on secondary phenotypes related to the primary disease status. In many situations, the secondary phenotypes are only measured in subjects with the disease condition. It would be advantageous to model the primary trait and the secondary phenotype together if they share certain level of genetic heritability. We propose a family of multi-locus testing procedures to detect the composite association between a set of genetic markers and two traits (the primary trait and a secondary phenotype), in order to identify genes influencing both traits. The proposed test is derived from a random effect model with two variance components, with each presenting the genetic effect on one trait, and incorporates a model selection procedure for seeking the optimal model to represent the two sources of genetic effects. We conduct simulation studies to evaluate performance of the proposed procedure and apply the method to a genome-wide association study of prostate cancer with the Gleason score as the secondary phenotype.
遗传关联研究通常会收集与主要疾病状态相关的次要表型信息。在许多情况下,仅在患有疾病的受试者中测量次要表型。如果主要特征和次要表型具有一定程度的遗传遗传性,则将它们一起建模是有利的。我们提出了一系列多基因座检测程序,以检测一组遗传标记与两种特征(主要特征和次要表型)之间的复合关联,从而确定影响两种特征的基因。该检验源自具有两个方差分量的随机效应模型,每个分量代表一种特征的遗传效应,并包含一个模型选择过程,用于寻找最佳模型来表示两种遗传效应的来源。我们进行了模拟研究,以评估所提出的程序的性能,并将该方法应用于前列腺癌的全基因组关联研究,其中 Gleason 评分作为次要表型。