Yu Zhaoxia, Schaid Daniel J
Division of Biostatistics, Department of Health Sciences Research, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA.
Genet Epidemiol. 2007 Sep;31(6):553-64. doi: 10.1002/gepi.20228.
Multi-locus association analyses, including haplotype-based analyses, can sometimes provide greater power than single-locus analyses for detecting disease susceptibility loci. This potential gain, however, can be compromised by the large number of degrees of freedom caused by irrelevant markers. Exhaustive search for the optimal set of markers might be possible for a small number of markers, yet it is computationally inefficient. In this paper, we present a sequential haplotype scan method to search for combinations of adjacent markers that are jointly associated with disease status. When evaluating each marker, we add markers close to it in a sequential manner: a marker is added if its contribution to the haplotype association with disease is warranted, conditional on current haplotypes. This conditional evaluation is based on the well-known Mantel-Haenszel statistic. We propose two permutation based methods to evaluate the growing haplotypes: a haplotype method for the combined markers, and a summary method that sums conditional statistics. We compared our proposed methods, the single-locus method, and a sliding window method using simulated data. We also applied our sequential haplotype scan algorithm to experimental data for CYP2D6. The results indicate that the sequential scan procedure can identify a set of adjacent markers whose haplotypes might have strong genetic effects or be in linkage disequilibrium with disease predisposing variants. As a result, our methods can achieve greater power than the single-locus method, yet is much more computationally efficient than sliding window methods.
多位点关联分析,包括基于单倍型的分析,在检测疾病易感基因座时,有时比单基因座分析具有更强的功效。然而,这种潜在的优势可能会因无关标记导致的大量自由度而受到影响。对于少数标记,详尽搜索最优标记集或许可行,但计算效率低下。在本文中,我们提出了一种顺序单倍型扫描方法,以搜索与疾病状态联合相关的相邻标记组合。在评估每个标记时,我们以顺序方式添加与其相邻的标记:如果根据当前单倍型,该标记对与疾病的单倍型关联有显著贡献,则将其添加。这种条件评估基于著名的Mantel-Haenszel统计量。我们提出了两种基于置换的方法来评估不断增长的单倍型:一种是针对组合标记的单倍型方法,另一种是汇总条件统计量的汇总方法。我们使用模拟数据比较了我们提出的方法、单基因座方法和滑动窗口方法。我们还将顺序单倍型扫描算法应用于CYP2D6的实验数据。结果表明,顺序扫描程序可以识别出一组相邻标记,其单倍型可能具有强大的遗传效应,或者与疾病易感变异处于连锁不平衡状态。因此,我们的方法比单基因座方法具有更强的功效,并且比滑动窗口方法具有更高的计算效率。