Donoghue M J, Mathews S
Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Avenue, Cambridge, Massachusetts, 02138, USA.
Mol Phylogenet Evol. 1998 Jun;9(3):489-500. doi: 10.1006/mpev.1998.0511.
The root of the angiosperm tree has not yet been established. Major morphological and molecular differences between angiosperms and other seed plants have introduced ambiguities and possibly spurious results. Because it is unlikely that extant species more closely related to angiosperms will be discovered, and because relevant fossils will almost certainly not yield molecular data, the use of duplicate genes for rooting purposes may provide the best hope of a solution. Simultaneous analysis of the genes resulting from a gene duplication event along the branch subtending angiosperms would yield an unrooted network, wherein two congruent gene trees should be connected by a single branch. In these circumstances the best rooted species tree is the one that corresponds to the two gene trees when the network is rooted along the connecting branch. In general, this approach can be viewed as choosing among rooted species trees by minimizing hypothesized events such as gene duplication, gene loss, lineage sorting, and lateral transfer. Of those gene families that are potentially relevant to the angiosperm problem, phytochrome genes warrant special attention. Phylogenetic analysis of a sample of complete phytochrome (PHY) sequences implies that an initial duplication event preceded (or occurred early within) the radiation of seed plants and that each of the two resulting copies duplicated again. In one of these cases, leading to the PHYA and PHYC lineages, duplication appears to have occurred before the diversification of angiosperms. Duplicate gene trees are congruent in these broad analyses, but the sample of sequences is too limited to provide much insight into the rooting question. Preliminary analyses of partial PHYA and PHYC sequences from several presumably basal angiosperm lineages are promising, but more data are needed to critically evaluate the power of these genes to resolve the angiosperm radiation.
被子植物谱系树的根部尚未确定。被子植物与其他种子植物之间在形态和分子方面存在重大差异,这带来了模糊性,并可能产生虚假结果。由于不太可能发现与被子植物关系更密切的现存物种,而且相关化石几乎肯定无法提供分子数据,因此使用重复基因来确定根部可能是找到解决方案的最大希望。沿着被子植物的分支对基因复制事件产生的基因进行同步分析,将得到一个无根网络,其中两个一致的基因树应由单个分支连接。在这种情况下,最佳的有根物种树是当网络沿着连接分支生根时与两个基因树相对应的那个。一般来说,这种方法可以被视为通过最小化诸如基因复制、基因丢失、谱系分选和横向转移等假设事件,在有根物种树中进行选择。在那些可能与被子植物问题相关的基因家族中,光敏色素基因值得特别关注。对完整光敏色素(PHY)序列样本进行的系统发育分析表明,一次初始复制事件发生在种子植物辐射之前(或在其早期发生),并且产生的两个副本中的每一个又再次复制。在其中一种情况下,导致了PHYA和PHYC谱系,复制似乎发生在被子植物多样化之前。在这些广泛的分析中,重复基因树是一致的,但序列样本过于有限,无法为解决根部问题提供太多见解。对几个可能是基部被子植物谱系的部分PHYA和PHYC序列进行的初步分析很有前景,但需要更多数据来严格评估这些基因解决被子植物辐射问题的能力。