Informatics Group, Harvard University, Cambridge, MA 02138, USA.
Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
Integr Comp Biol. 2024 Nov 21;64(5):1513-1525. doi: 10.1093/icb/icae056.
Comparative genomics provides ample ways to study genome evolution and its relationship to phenotypic traits. By developing and testing alternate models of evolution throughout a phylogeny, one can estimate rates of molecular evolution along different lineages in a phylogeny and link these rates with observations in extant species, such as convergent phenotypes. Pipelines for such work can help identify when and where genomic changes may be associated with, or possibly influence, phenotypic traits. We recently developed a set of models called PhyloAcc, using a Bayesian framework to estimate rates of nucleotide substitution on different branches of a phylogenetic tree and evaluate their association with pre-defined or estimated phenotypic traits. PhyloAcc-ST and PhyloAcc-GT both allow users to define a priori a set of target lineages and then compare different models to identify loci accelerating in one or more target lineages. Whereas ST considers only one species tree across all input loci, GT considers alternate topologies for every locus. PhyloAcc-C simultaneously models molecular rates and rates of continuous trait evolution, allowing the user to ask whether the two are associated. Here, we describe these models and provide tips and workflows on how to prepare the input data and run PhyloAcc.
比较基因组学为研究基因组进化及其与表型特征的关系提供了丰富的方法。通过在系统发育中开发和测试进化的替代模型,可以估计系统发育中不同谱系的分子进化率,并将这些速率与现存物种的观察结果(如趋同表型)联系起来。此类工作的管道可以帮助确定何时以及何处的基因组变化可能与表型特征相关,或者可能影响表型特征。我们最近开发了一组称为 PhyloAcc 的模型,使用贝叶斯框架来估计系统发育树上不同分支的核苷酸替换率,并评估它们与预定义或估计的表型特征的关联。PhyloAcc-ST 和 PhyloAcc-GT 都允许用户先验定义一组目标谱系,然后比较不同的模型来识别在一个或多个目标谱系中加速的基因座。而 ST 仅考虑所有输入基因座的一个物种树,GT 则为每个基因座考虑替代拓扑结构。PhyloAcc-C 同时对分子速率和连续性状进化速率进行建模,允许用户询问这两者是否相关。在这里,我们描述了这些模型,并提供了有关如何准备输入数据和运行 PhyloAcc 的提示和工作流程。