Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan 35053, Taiwan.
Bioinformatics. 2017 Nov 15;33(22):3595-3602. doi: 10.1093/bioinformatics/btx409.
Identification of single nucleotide polymorphism (SNP) interactions is an important and challenging topic in genome-wide association studies (GWAS). Many approaches have been applied to detecting whole-genome interactions. However, these approaches to interaction analysis tend to miss causal interaction effects when the individual marginal effects are uncorrelated to trait, while their interaction effects are highly associated with the trait.
A grouped variable selection technique, called two-stage grouped sure independence screening (TS-GSIS), is developed to study interactions that may not have marginal effects. The proposed TS-GSIS is shown to be very helpful in identifying not only causal SNP effects that are uncorrelated to trait but also their corresponding SNP-SNP interaction effects. The benefit of TS-GSIS are gaining detection of interaction effects by taking the joint information among the SNPs and determining the size of candidate sets in the model. Simulation studies under various scenarios are performed to compare performance of TS-GSIS and current approaches. We also apply our approach to a real rheumatoid arthritis (RA) dataset. Both the simulation and real data studies show that the TS-GSIS performs very well in detecting SNP-SNP interactions.
R-package is delivered through CRAN and is available at: https://cran.r-project.org/web/packages/TSGSIS/index.html.
Supplementary data are available at Bioinformatics online.
单核苷酸多态性(SNP)相互作用的鉴定是全基因组关联研究(GWAS)中的一个重要且具有挑战性的课题。已经有许多方法被应用于检测全基因组相互作用。然而,当个体边际效应与性状不相关,而其相互作用效应与性状高度相关时,这些相互作用分析方法往往会错过因果相互作用效应。
提出了一种分组变量选择技术,称为两阶段分组确定性独立筛选(TS-GSIS),用于研究可能没有边际效应的相互作用。所提出的 TS-GSIS 被证明非常有助于识别不仅与性状不相关的因果 SNP 效应,而且还识别其相应的 SNP-SNP 相互作用效应。TS-GSIS 的优点是通过利用 SNP 之间的联合信息并确定模型中候选集的大小来获得检测相互作用效应的益处。在各种场景下进行了模拟研究,以比较 TS-GSIS 和当前方法的性能。我们还将我们的方法应用于真实的类风湿关节炎(RA)数据集。模拟和真实数据研究都表明,TS-GSIS 在检测 SNP-SNP 相互作用方面表现非常出色。
R 包通过 CRAN 提供,并可在以下网址获得:https://cran.r-project.org/web/packages/TSGSIS/index.html。
补充数据可在生物信息学在线获得。