State Key Laboratory of Genetic Resources and Evolution/Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
Mol Biol Evol. 2019 Aug 1;36(8):1643-1652. doi: 10.1093/molbev/msz072.
A general south-north genetic divergence has been observed among Han Chinese in previous studies. However, these studies, especially those on mitochondrial DNA (mtDNA), are based either on partial mtDNA sequences or on limited samples. Given that Han Chinese comprise the world's largest population and reside around the whole China, whether the north-south divergence can be observed after all regional populations are considered remains unknown. Moreover, factors involved in shaping the genetic landscape of Han Chinese need further investigation. In this study, we dissected the matrilineal landscape of Han Chinese by studying 4,004 mtDNA haplogroup-defining variants in 21,668 Han samples from virtually all provinces in China. Our results confirmed the genetic divergence between southern and northern Han populations. However, we found a significant genetic divergence among populations from the three main river systems, that is, the Yangtze, the Yellow, and the Zhujiang (Pearl) rivers, which largely attributed to the prevalent distribution of haplogroups D4, B4, and M7 in these river valleys. Further analyses based on 4,986 mitogenomes, including 218 newly generated sequences, indicated that this divergence was already established during the early Holocene and may have resulted from population expansion facilitated by ancient agricultures along these rivers. These results imply that the maternal gene pools of the contemporary Han populations have retained the genetic imprint of early Neolithic farmers from different river basins, or that river valleys represented relative migration barriers that facilitated genetic differentiation, thus highlighting the importance of the three ancient agricultures in shaping the genetic landscape of the Han Chinese.
先前的研究表明,汉族人群存在普遍的南北遗传分化。然而,这些研究,特别是关于线粒体 DNA(mtDNA)的研究,要么基于部分 mtDNA 序列,要么基于有限的样本。鉴于汉族是世界上最大的人口群体,分布在中国各地,在考虑到所有地区人群后,是否真的存在南北分化仍不得而知。此外,汉族人群遗传景观形成的相关因素仍需进一步研究。在这项研究中,我们通过研究来自中国几乎所有省份的 21668 名汉族个体的 4004 个 mtDNA 单倍群定义变体,解析了汉族人群的母系遗传景观。研究结果证实了南北汉族人群的遗传分化。然而,我们发现来自长江、黄河和珠江(珠江)三大水系的人群之间存在显著的遗传分化,这主要归因于这些河谷中 D4、B4 和 M7 等单倍群的流行分布。基于包括 218 个新生成序列的 4986 个线粒体基因组的进一步分析表明,这种分化早在全新世早期就已经确立,可能是由这些河流沿岸的古代农业促进的人口扩张所致。这些结果表明,当代汉族人群的母系基因库保留了来自不同流域的早期新石器时代农民的遗传印记,或者说河谷代表了促进遗传分化的相对迁移障碍,从而凸显了三大古代农业在塑造汉族人群遗传景观方面的重要性。