Laboratory for Medical Informatics, Center for Genomic Medicine, The Institute of Physical and Chemical Research (RIKEN), Yokohama, Japan.
J Hum Genet. 2012 May;57(5):326-34. doi: 10.1038/jhg.2012.26. Epub 2012 Mar 29.
Although the Japanese population has a rather low genetic diversity, we recently confirmed the presence of two main clusters (the Hondo and Ryukyu clusters) through principal component analysis of genome-wide single-nucleotide polymorphism (SNP) genotypes. Understanding the genetic differences between the two main clusters requires further genome-wide analyses based on a dense SNP set and comparison of haplotype frequencies. In the present study, we determined haplotypes for the Hondo cluster of the Japanese population by detecting SNP homozygotes with 388,591 autosomal SNPs from 18,379 individuals and estimated the haplotype frequencies. Haplotypes for the Ryukyu cluster were inferred by a statistical approach using the genotype data from 504 individuals. We then compared the haplotype frequencies between the Hondo and Ryukyu clusters. In most genomic regions, the haplotype frequencies in the Hondo and Ryukyu clusters were very similar. However, in addition to the human leukocyte antigen region on chromosome 6, other genomic regions (chromosomes 3, 4, 5, 7, 10 and 12) showed dissimilarities in haplotype frequency. These regions were enriched for genes involved in the immune system, cell-cell adhesion and the intracellular signaling cascade. These differentiated genomic regions between the Hondo and Ryukyu clusters are of interest because they (1) should be examined carefully in association studies and (2) likely contain genes responsible for morphological or physiological differences between the two groups.
尽管日本人口的遗传多样性较低,但我们最近通过全基因组单核苷酸多态性 (SNP) 基因型的主成分分析证实了两个主要聚类(本州和琉球聚类)的存在。了解两个主要聚类之间的遗传差异需要进一步进行基于密集 SNP 集的全基因组分析,并比较单倍型频率。在本研究中,我们通过检测来自 18379 个人的 388591 个常染色体 SNP 的 SNP 纯合子来确定日本人群本州聚类的单倍型,并估计单倍型频率。通过使用 504 个人的基因型数据,通过统计方法推断琉球聚类的单倍型。然后,我们比较了本州和琉球聚类之间的单倍型频率。在大多数基因组区域,本州和琉球聚类的单倍型频率非常相似。然而,除了 6 号染色体上的人类白细胞抗原区域外,其他基因组区域(3、4、5、7、10 和 12 号染色体)的单倍型频率存在差异。这些区域富含参与免疫系统、细胞-细胞黏附和细胞内信号级联的基因。本州和琉球聚类之间这些分化的基因组区域很有趣,因为它们 (1) 应该在关联研究中仔细检查,并且 (2) 可能包含负责两组之间形态或生理差异的基因。