Wang Yan, Zhao Lue Ping, Dudoit Sandrine
Division of Biostatistics, University of California, Berkeley, USA.
Am J Hum Genet. 2006 Apr;78(4):615-28. doi: 10.1086/502632. Epub 2006 Feb 13.
High-throughput genotyping technologies for SNPs have enabled the recent completion of the International HapMap Project (phase I), which has stimulated much interest in studying genomewide linkage-disequilibrium (LD) patterns. Conventional LD measures, such as D' and r(2), are two-point measurements, and their relationship with physical distance is highly noisy. We propose a new LD measure, Delta , defined in terms of the correlation coefficient for shared haplotype lengths around two loci, thereby borrowing information from multiple loci. A U-statistic-based estimator of Delta , which takes into consideration the dependence structure of the observed data, is developed and compared with an estimator based on the usual empirical correlation coefficient. Furthermore, we propose methods for inferring LD-decay rates and recombination hotspots on the basis of Delta . The results from coalescent-simulation studies and analysis of HapMap SNP data demonstrate that the proposed estimators of Delta are superior to the two most popular conventional LD measures, in terms of their close relationship with physical distance and recombination rate, their small variability, and their strong robustness to marker-allele frequencies. These merits may offer new opportunities for mapping complex disease genes and for investigating recombination mechanisms on the basis of better-quantified LD.
单核苷酸多态性(SNP)的高通量基因分型技术促成了国际人类基因组单体型图计划(第一阶段)近期的完成,该计划激发了人们对全基因组连锁不平衡(LD)模式研究的浓厚兴趣。传统的LD测量方法,如D'和r(2),是两点测量,它们与物理距离的关系噪声很大。我们提出了一种新的LD测量方法Delta,它是根据两个位点周围共享单倍型长度的相关系数来定义的,从而从多个位点借用信息。我们开发了一种基于U统计量的Delta估计器,该估计器考虑了观测数据的依赖结构,并将其与基于通常经验相关系数的估计器进行了比较。此外,我们还提出了基于Delta推断LD衰减率和重组热点的方法。合并模拟研究和人类基因组单体型图计划SNP数据分析的结果表明,就与物理距离和重组率的密切关系、小变异性以及对标记等位基因频率的强稳健性而言,所提出的Delta估计器优于两种最常用的传统LD测量方法。这些优点可能为定位复杂疾病基因以及基于更好量化的LD研究重组机制提供新的机会。