Xu Hongyan, George Varghese
Department of Biostatistics, Georgia Health Sciences University, Augusta,GA, USA.
BMC Res Notes. 2011 Apr 14;4:124. doi: 10.1186/1756-0500-4-124.
Genetic association studies, especially genome-wide studies, make use of linkage disequilibrium(LD) information between single nucleotide polymorphisms (SNPs). LD is also used for studying genome structure and has been valuable for evolutionary studies. The strength of LD is commonly measured by r2, a statistic closely related to the Pearson's χ2 statistic. However, the computation and testing of linkage disequilibrium using r2 requires known haplotype counts of the SNP pair, which can be a problem for most population-based studies where the haplotype phase is unknown. Most statistical genetic packages use likelihood-based methods to infer haplotypes. However, the variability of haplotype estimation needs to be accounted for in the test for linkage disequilibrium.
We develop a Monte Carlo based test for LD based on the null distribution of the r2 statistic. Our test is based on r2 and can be reported together with r2. Simulation studies show that it offers slightly better power than existing methods.
Our approach provides an alternative test for LD and has been implemented as a R program for ease of use. It also provides a general framework to account for other haplotype inference methods in LD testing.
基因关联研究,尤其是全基因组研究,会利用单核苷酸多态性(SNP)之间的连锁不平衡(LD)信息。LD也被用于研究基因组结构,并且在进化研究中具有重要价值。LD的强度通常用r2来衡量,r2是一个与皮尔逊卡方统计量密切相关的统计量。然而,使用r2计算和检验连锁不平衡需要已知SNP对的单倍型计数,这对于大多数群体研究来说可能是个问题,因为在这些研究中,单倍型相位是未知的。大多数统计遗传软件包使用基于似然性的方法来推断单倍型。然而,在连锁不平衡检验中需要考虑单倍型估计的变异性。
我们基于r2统计量的零分布开发了一种基于蒙特卡洛的LD检验方法。我们的检验基于r2,可以与r2一起报告。模拟研究表明,它比现有方法具有略高的检验效能。
我们的方法为LD提供了一种替代检验方法,并已实现为一个R程序以便于使用。它还为在LD检验中考虑其他单倍型推断方法提供了一个通用框架。