Chen Zhongxue, Ng Hon Keung Tony, Li Jing, Liu Qingzhong, Huang Hanwen
1 Department of Epidemiology and Biostatistics, School of Public Health, Indiana University Bloomington, Bloomington, IN, USA.
2 Department of Statistical Science, Southern Methodist University, Dallas, TX, USA.
Stat Methods Med Res. 2017 Apr;26(2):567-582. doi: 10.1177/0962280214551815. Epub 2014 Sep 24.
In the past decade, hundreds of genome-wide association studies have been conducted to detect the significant single-nucleotide polymorphisms that are associated with certain diseases. However, most of the data from the X chromosome were not analyzed and only a few significant associated single-nucleotide polymorphisms from the X chromosome have been identified from genome-wide association studies. This is mainly due to the lack of powerful statistical tests. In this paper, we propose a novel statistical approach that combines the information of single-nucleotide polymorphisms on the X chromosome from both males and females in an efficient way. The proposed approach avoids the need of making strong assumptions about the underlying genetic models. Our proposed statistical test is a robust method that only makes the assumption that the risk allele is the same for both females and males if the single-nucleotide polymorphism is associated with the disease for both genders. Through simulation study and a real data application, we show that the proposed procedure is robust and have excellent performance compared to existing methods. We expect that many more associated single-nucleotide polymorphisms on the X chromosome will be identified if the proposed approach is applied to current available genome-wide association studies data.
在过去十年中,已经开展了数百项全基因组关联研究,以检测与某些疾病相关的显著单核苷酸多态性。然而,来自X染色体的大部分数据并未得到分析,并且从全基因组关联研究中仅鉴定出少数来自X染色体的显著相关单核苷酸多态性。这主要是由于缺乏强大的统计检验方法。在本文中,我们提出了一种新颖的统计方法,该方法以有效的方式结合了来自男性和女性X染色体上单核苷酸多态性的信息。所提出的方法避免了对潜在遗传模型做出强假设的必要性。我们提出的统计检验是一种稳健的方法,仅假设如果单核苷酸多态性与两性疾病均相关,则女性和男性的风险等位基因相同。通过模拟研究和实际数据应用,我们表明所提出的程序是稳健的,并且与现有方法相比具有优异的性能。我们预计,如果将所提出的方法应用于当前可用的全基因组关联研究数据,将会鉴定出更多来自X染色体的相关单核苷酸多态性。