Department of Epidemiology and Biostatistics, School of Public Health, Shandong University, Jinan 250012, China.
BMC Genet. 2013 Sep 23;14:89. doi: 10.1186/1471-2156-14-89.
On thinking quantitatively of complex diseases, there are at least three statistical strategies for analyzing the gene-gene interaction: SNP by SNP interaction on single trait, gene-gene (each can involve multiple SNPs) interaction on single trait and gene-gene interaction on multiple traits. The third one is the most general in dissecting the genetic mechanism underlying complex diseases underpinning multiple quantitative traits. In this paper, we developed a novel statistic for this strategy through modifying the Partial Least Squares Path Modeling (PLSPM), called mPLSPM statistic.
Simulation studies indicated that mPLSPM statistic was powerful and outperformed the principal component analysis (PCA) based linear regression method. Application to real data in the EPIC-Norfolk GWAS sub-cohort showed suggestive interaction (γ) between TMEM18 gene and BDNF gene on two composite body shape scores (γ = 0.047 and γ = 0.058, with P = 0.021, P = 0.005), and BMI (γ = 0.043, P = 0.034). This suggested these scores (synthetically latent traits) were more suitable to capture the obesity related genetic interaction effect between genes compared to single trait.
The proposed novel mPLSPM statistic is a valid and powerful gene-based method for detecting gene-gene interaction on multiple quantitative phenotypes.
在对复杂疾病进行定量分析时,有至少三种分析基因-基因相互作用的统计策略:单性状的 SNP 间相互作用、单性状的基因-基因(每个基因可以涉及多个 SNP)相互作用和多性状的基因-基因相互作用。第三种策略是分析多数量性状复杂疾病遗传机制的最普遍方法。本文通过修改偏最小二乘路径建模(PLSPM),开发了一种用于该策略的新统计方法,称为 mPLSPM 统计量。
模拟研究表明,mPLSPM 统计量强大且优于基于主成分分析(PCA)的线性回归方法。在 EPIC-Norfolk GWAS 子队列的真实数据中的应用表明,TMEM18 基因和 BDNF 基因之间在两个复合体型评分(γ=0.047 和 γ=0.058,P=0.021,P=0.005)和 BMI(γ=0.043,P=0.034)上存在显著的相互作用(γ)。这表明与单性状相比,这些评分(综合潜在性状)更适合捕获基因之间与肥胖相关的遗传相互作用效应。
所提出的新型 mPLSPM 统计方法是一种有效的、强大的基于基因的方法,可用于检测多个数量性状上的基因-基因相互作用。