Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun, 130024, Jilin, China.
Department of Mathematics, College of Science, Yanbian University, Yanji, 133002, Jilin, China.
BMC Bioinformatics. 2024 Apr 4;25(1):144. doi: 10.1186/s12859-024-05731-8.
Joint analysis of multiple phenotypes in studies of biological systems such as Genome-Wide Association Studies is critical to revealing the functional interactions between various traits and genetic variants, but growth of data in dimensionality has become a very challenging problem in the widespread use of joint analysis. To handle the excessiveness of variables, we consider the sliced inverse regression (SIR) method. Specifically, we propose a novel SIR-based association test that is robust and powerful in testing the association between multiple predictors and multiple outcomes.
We conduct simulation studies in both low- and high-dimensional settings with various numbers of Single-Nucleotide Polymorphisms and consider the correlation structure of traits. Simulation results show that the proposed method outperforms the existing methods. We also successfully apply our method to the genetic association study of ADNI dataset. Both the simulation studies and real data analysis show that the SIR-based association test is valid and achieves a higher efficiency compared with its competitors.
Several scenarios with low- and high-dimensional responses and genotypes are considered in this paper. Our SIR-based method controls the estimated type I error at the pre-specified level .
在全基因组关联研究等生物系统的研究中,联合分析多个表型对于揭示各种特征和遗传变异之间的功能相互作用至关重要,但数据维度的增长已成为联合分析广泛应用中的一个非常具有挑战性的问题。为了处理变量的过多问题,我们考虑切片逆回归(SIR)方法。具体来说,我们提出了一种新的基于 SIR 的关联测试方法,该方法在测试多个预测因子和多个结果之间的关联时具有稳健性和强大性。
我们在具有不同数量单核苷酸多态性的低维和高维设置中进行了模拟研究,并考虑了性状的相关结构。模拟结果表明,所提出的方法优于现有方法。我们还成功地将我们的方法应用于 ADNI 数据集的遗传关联研究。模拟研究和实际数据分析均表明,基于 SIR 的关联测试是有效的,与竞争对手相比,它具有更高的效率。
本文考虑了低维和高维响应和基因型的几种情况。我们的基于 SIR 的方法可以控制预指定水平的估计Ⅰ型错误率。