Boes Olivier, Fischer Mareike, Kelk Steven
IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):472-477. doi: 10.1109/TCBB.2016.2543727. Epub 2016 Mar 17.
Given two phylogenetic trees on the same set of taxa X , the maximum parsimony distance d is defined as the maximum, ranging over all characters χ on X, of the absolute difference in parsimony score induced by χ on the two trees. In this note, we prove that for binary trees there exists a character achieving this maximum that is convex on one of the trees (i.e., the parsimony score induced on that tree is equal to the number of states in the character minus 1) and such that the number of states in the character is at most 7d-5 . This is the first non-trivial bound on the number of states required by optimal characters, convex or otherwise. The result potentially has algorithmic significance because, unlike general characters, convex characters with a bounded number of states can be enumerated in polynomial time.
给定同一分类单元集(X)上的两棵系统发育树,最大简约距离(d)定义为:在(X)上的所有字符(\chi)中,(\chi)在两棵树上诱导的简约得分的绝对差值的最大值。在本笔记中,我们证明对于二叉树,存在一个达到此最大值的字符,该字符在其中一棵树中是凸的(即,在该树上诱导的简约得分等于字符中的状态数减(1)),并且该字符中的状态数至多为(7d - 5)。这是关于最优字符(无论是否为凸字符)所需状态数的首个非平凡界限。该结果可能具有算法意义,因为与一般字符不同,具有有界状态数的凸字符可以在多项式时间内枚举。