Sawyer Sarah L, Mukherjee Namita, Pakstis Andrew J, Feuk Lars, Kidd Judith R, Brookes Anthony J, Kidd Kenneth K
Center for Genomics and Bioinformatics, Karolinska Institute, Berzelius väg 35, Stockholm 17177, Sweden.
Eur J Hum Genet. 2005 May;13(5):677-86. doi: 10.1038/sj.ejhg.5201368.
A major initiative to create a global human haplotype map has recently been launched as a tool to improve the efficiency of disease gene mapping. The 'HapMap' project will study common variants in depth in four (and to a lesser degree in up to 12) populations to catalogue haplotypes that are expected to be common to all populations. A hope of the 'HapMap' project is that much of the genome occurs in regions of limited diversity such that only a few of the SNPs in each region will capture the diversity and be relevant around the world. In order to explore the implications of studying only a limited number of populations, we have analyzed linkage disequilibrium (LD) patterns of three 175-320 kb genomic regions in 16 diverse populations with an emphasis on African and European populations. Analyses of these three genomic regions provide empiric demonstration of marked differences in frequencies of the same few haplotypes, resulting in differences in the amount of LD and very different sets of haplotype frequencies. These results highlight the distinction between the statistical concept of LD and the biological reality of haplotypes and their frequencies. The significant quantitative and qualitative variation in LD among populations, even for populations within a geographic region, emphasizes the importance of studying diverse populations in the HapMap project to assure broad applicability of the results.
最近启动了一项创建全球人类单倍型图谱的重大计划,以此作为提高疾病基因定位效率的工具。“人类单倍型图谱(HapMap)”项目将深入研究四个(在多达12个群体中的研究程度较低)群体中的常见变异,以编目预计在所有群体中都常见的单倍型。“人类单倍型图谱”项目的一个期望是,基因组的大部分存在于多样性有限的区域,这样每个区域中只有少数单核苷酸多态性(SNP)能够捕捉到多样性并且在全球范围内具有相关性。为了探讨仅研究有限数量群体的影响,我们分析了16个不同群体中三个175 - 320 kb基因组区域的连锁不平衡(LD)模式,重点是非洲和欧洲群体。对这三个基因组区域的分析提供了实证证明,即相同少数单倍型的频率存在显著差异,导致连锁不平衡程度不同以及单倍型频率集差异很大。这些结果突出了连锁不平衡的统计概念与单倍型及其频率的生物学现实之间的区别。群体间连锁不平衡在数量和质量上的显著差异,即使是同一地理区域内的群体,也强调了在“人类单倍型图谱”项目中研究不同群体以确保结果具有广泛适用性的重要性。