Alimpiev Egor, Rosenberg Noah A
Department of Biology, Stanford University, Stanford, CA 94305 USA.
Adv Appl Math. 2021 Oct;131. doi: 10.1016/j.aam.2021.102265. Epub 2021 Aug 23.
For a fixed set containing taxon labels, an ordered pair consisting of a gene tree topology and a species tree topology bijectively labeled with the labels of possesses a set of coalescent histories-mappings from the set of internal nodes of to the set of edges of describing possible lists of edges in on which the coalescences in take place. Enumerations of coalescent histories for gene trees and species trees have produced suggestive results regarding the pairs () that, for a fixed , have the largest number of coalescent histories. We define a class of 2-cherry binary tree topologies that we term , examining coalescent histories for non-matching pairs () in the case in which has a caterpillar shape and has a -pseudocaterpillar shape. Using a construction that associates coalescent histories for () with a class of "roadblocked" monotonic paths, we identify the -pseudocaterpillar labeled gene tree topology that, for a fixed caterpillar labeled species tree topology, gives rise to the largest number of coalescent histories. The shape that maximizes the number of coalescent histories places the "second" cherry of the -pseudocaterpillar equidistantly from the root of the "first" cherry and from the tree root. A symmetry in the numbers of coalescent histories for -pseudocaterpillar gene trees and caterpillar species trees is seen to exist around the maximizing value of the parameter . The results provide insight into the factors that influence the number of coalescent histories possible for a given gene tree and species tree.
对于一个包含分类单元标签的固定集合,由基因树拓扑结构和用集合中的标签进行双射标记的物种树拓扑结构组成的有序对,拥有一组合并历史——从基因树的内部节点集合到物种树的边集合的映射,描述了基因树中合并发生的物种树边的可能列表。基因树和物种树合并历史的枚举对于固定的集合,已经产生了关于具有最大数量合并历史的(基因树拓扑结构,物种树拓扑结构)对的启发性结果。我们定义了一类我们称为(\mathcal{T})的2-樱桃二叉树拓扑结构,研究在物种树具有毛毛虫形状且基因树具有(\mathcal{T})-伪毛毛虫形状的情况下不匹配对(基因树拓扑结构,物种树拓扑结构)的合并历史。通过一种将(基因树拓扑结构,物种树拓扑结构)的合并历史与一类“受阻”单调路径相关联的构造,我们确定了对于固定的毛毛虫标记物种树拓扑结构,产生最大数量合并历史的(\mathcal{T})-伪毛毛虫标记基因树拓扑结构。使合并历史数量最大化的形状将(\mathcal{T})-伪毛毛虫的“第二个”樱桃放置在距“第一个”樱桃的根和树根等距的位置。在参数的最大值附近,可以看到(\mathcal{T})-伪毛毛虫基因树和毛毛虫物种树的合并历史数量存在对称性。这些结果为影响给定基因树和物种树可能的合并历史数量的因素提供了见解。