Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, Munich, Germany.
Department of Environment and Biodiversity, Paris Lodron University of Salzburg, Salzburg, Austria.
Elife. 2024 Sep 12;12:RP89470. doi: 10.7554/eLife.89470.
With the availability of high-quality full genome polymorphism (SNPs) data, it becomes feasible to study the past demographic and selective history of populations in exquisite detail. However, such inferences still suffer from a lack of statistical resolution for recent, for example bottlenecks, events, and/or for populations with small nucleotide diversity. Additional heritable (epi)genetic markers, such as indels, transposable elements, microsatellites, or cytosine methylation, may provide further, yet untapped, information on the recent past population history. We extend the Sequential Markovian Coalescent (SMC) framework to jointly use SNPs and other hyper-mutable markers. We are able to (1) improve the accuracy of demographic inference in recent times, (2) uncover past demographic events hidden to SNP-based inference methods, and (3) infer the hyper-mutable marker mutation rates under a finite site model. As a proof of principle, we focus on demographic inference in using DNA methylation diversity data from 10 European natural accessions. We demonstrate that segregating single methylated polymorphisms (SMPs) satisfy the modeling assumptions of the SMC framework, while differentially methylated regions (DMRs) are not suitable as their length exceeds that of the genomic distance between two recombination events. Combining SNPs and SMPs while accounting for site- and region-level epimutation processes, we provide new estimates of the glacial age bottleneck and post-glacial population expansion of the European population. Our SMC framework readily accounts for a wide range of heritable genomic markers, thus paving the way for next-generation inference of evolutionary history by combining information from several genetic and epigenetic markers.
随着高质量全基因组多态性(SNP)数据的可用性,研究人口过去的人口和选择历史变得可行。然而,这种推断仍然缺乏对近期事件(例如瓶颈、瓶颈和/或核苷酸多样性较小的种群)的统计分辨率。其他可遗传(外)遗传标记,如插入缺失、转座元件、微卫星或胞嘧啶甲基化,可能会提供有关最近人口历史的进一步信息,而这些信息尚未开发。我们将顺序马尔可夫融合(SMC)框架扩展到联合使用 SNP 和其他超突变标记。我们能够(1)提高近期人口推断的准确性,(2)揭示 SNP 推断方法隐藏的过去人口事件,(3)在有限的位点模型下推断超突变标记的突变率。作为一个原理证明,我们专注于使用来自 10 个欧洲自然接入点的 DNA 甲基化多样性数据对进行人口推断。我们证明,分离的单甲基化多态性(SMP)满足 SMC 框架的建模假设,而差异甲基化区域(DMR)不适合作为它们的长度超过两个重组事件之间的基因组距离。在考虑位点和区域水平的外遗传突变过程的情况下,将 SNP 和 SMP 结合起来,我们为欧洲种群的冰河时代瓶颈和冰河时代后人口扩张提供了新的估计。我们的 SMC 框架很容易处理各种可遗传的基因组标记,从而为通过结合来自几种遗传和表观遗传标记的信息来推断下一代进化历史铺平了道路。