Cheng Xiaoheng, Steinrücken Matthias
Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA.
Department of Human Genetics, University of Chicago, Chicago IL, USA.
bioRxiv. 2024 Dec 1:2023.10.12.562101. doi: 10.1101/2023.10.12.562101.
Whole-genome time-series allele frequency data are becoming more prevalent as ancient DNA (aDNA) sequences and data from evolve-and-resequence (E&R) experiments are generated at a rapid pace. Such data presents unprecedented opportunities to elucidate the dynamics of genetic variation under selection. However, despite many methods to infer parameters of selection models from allele frequency trajectories available in the literature, few provide user-friendly implementations for large-scale empirical applications. Here, we present diplo-locus, an open-source Python package that provides functionality to simulate and perform inference from time-series data under the Wright-Fisher diffusion with general diploid selection. The package includes Python modules as well as command-line tools and is available at: https://github.com/steinrue/diplo_locus.
随着古代DNA(aDNA)序列以及进化与重测序(E&R)实验数据的快速生成,全基因组时间序列等位基因频率数据正变得越来越普遍。这类数据为阐明选择作用下遗传变异的动态变化提供了前所未有的机会。然而,尽管文献中有许多从等位基因频率轨迹推断选择模型参数的方法,但很少有方法能为大规模实证应用提供用户友好的实现方式。在此,我们介绍diplo-locus,这是一个开源的Python包,它提供了在具有一般二倍体选择的赖特-费希尔扩散模型下,对时间序列数据进行模拟和推断的功能。该包包括Python模块以及命令行工具,可从以下网址获取:https://github.com/steinrue/diplo_locus 。