Hou Ting-Ting, Lin Feng, Bai Shasha, Cleves Mario A, Xu Hai-Ming, Lou Xiang-Yang
Biostatistics Program, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, Arkansas.
Arkansas Children's Research Institute, Little Rock, Arkansas.
Genet Epidemiol. 2019 Feb;43(1):24-36. doi: 10.1002/gepi.22169. Epub 2018 Nov 2.
The manifestation of complex traits is influenced by gene-gene and gene-environment interactions, and the identification of multifactor interactions is an important but challenging undertaking for genetic studies. Many complex phenotypes such as disease severity are measured on an ordinal scale with more than two categories. A proportional odds model can improve statistical power for these outcomes, when compared to a logit model either collapsing the categories into two mutually exclusive groups or limiting the analysis to pairs of categories. In this study, we propose a proportional odds model-based generalized multifactor dimensionality reduction (GMDR) method for detection of interactions underlying polytomous ordinal phenotypes. Computer simulations demonstrated that this new GMDR method has a higher power and more accurate predictive ability than the GMDR methods based on a logit model and a multinomial logit model. We applied this new method to the genetic analysis of low-density lipoprotein (LDL) cholesterol, a causal risk factor for coronary artery disease, in the Multi-Ethnic Study of Atherosclerosis, and identified a significant joint action of the CELSR2, SERPINA12, HPGD, and APOB genes. This finding provides new information to advance the limited knowledge about genetic regulation and gene interactions in metabolic pathways of LDL cholesterol. In conclusion, the proportional odds model-based GMDR is a useful tool that can boost statistical power and prediction accuracy in studying multifactor interactions underlying ordinal traits.
复杂性状的表现受到基因-基因和基因-环境相互作用的影响,识别多因素相互作用对于遗传学研究而言是一项重要但具有挑战性的任务。许多复杂的表型,如疾病严重程度,是按具有两个以上类别的有序尺度来衡量的。与将类别合并为两个相互排斥的组或将分析限制在类别对的逻辑回归模型相比,比例优势模型可以提高这些结果的统计效力。在本研究中,我们提出了一种基于比例优势模型的广义多因素降维(GMDR)方法,用于检测多分类有序表型背后的相互作用。计算机模拟表明,这种新的GMDR方法比基于逻辑回归模型和多项逻辑回归模型的GMDR方法具有更高的效力和更准确的预测能力。我们将这种新方法应用于动脉粥样硬化多民族研究中低密度脂蛋白(LDL)胆固醇(冠状动脉疾病的一个因果风险因素)的遗传分析,并确定了CELSR2、SERPINA12、HPGD和APOB基因的显著联合作用。这一发现为推进关于LDL胆固醇代谢途径中基因调控和基因相互作用的有限知识提供了新信息。总之,基于比例优势模型的GMDR是一种有用的工具,可提高研究有序性状背后多因素相互作用的统计效力和预测准确性。