Department of Biological Science and the Mervin Bovaird Institute for Molecular Biology and Biotechnology, The University of Tulsa, Tulsa, Oklahoma, United States of America.
PLoS One. 2011 Feb 10;6(2):e16931. doi: 10.1371/journal.pone.0016931.
Chloroplast-encoded genes (matK and rbcL) have been formally proposed for use in DNA barcoding efforts targeting embryophytes. Extending such a protocol to chlorophytan green algae, though, is fraught with problems including non homology (matK) and heterogeneity that prevents the creation of a universal PCR toolkit (rbcL). Some have advocated the use of the nuclear-encoded, internal transcribed spacer two (ITS2) as an alternative to the traditional chloroplast markers. However, the ITS2 is broadly perceived to be insufficiently conserved or to be confounded by introgression or biparental inheritance patterns, precluding its broad use in phylogenetic reconstruction or as a DNA barcode. A growing body of evidence has shown that simultaneous analysis of nucleotide data with secondary structure information can overcome at least some of the limitations of ITS2. The goal of this investigation was to assess the feasibility of an automated, sequence-structure approach for analysis of IT2 data from a large sampling of phylum Chlorophyta.
METHODOLOGY/PRINCIPAL FINDINGS: Sequences and secondary structures from 591 chlorophycean, 741 trebouxiophycean and 938 ulvophycean algae, all obtained from the ITS2 Database, were aligned using a sequence structure-specific scoring matrix. Phylogenetic relationships were reconstructed by Profile Neighbor-Joining coupled with a sequence structure-specific, general time reversible substitution model. Results from analyses of the ITS2 data were robust at multiple nodes and showed considerable congruence with results from published phylogenetic analyses.
CONCLUSIONS/SIGNIFICANCE: Our observations on the power of automated, sequence-structure analyses of ITS2 to reconstruct phylum-level phylogenies of the green algae validate this approach to assessing diversity for large sets of chlorophytan taxa. Moreover, our results indicate that objections to the use of ITS2 for DNA barcoding should be weighed against the utility of an automated, data analysis approach with demonstrated power to reconstruct evolutionary patterns for highly divergent lineages.
叶绿体编码基因(matK 和 rbcL)已被正式提议用于针对胚胎植物的 DNA 条形码研究。然而,将这样的方案扩展到绿藻门的绿藻中存在许多问题,包括非同源性(matK)和异质性,这使得创建通用 PCR 工具包(rbcL)变得困难。有人主张使用核编码的内部转录间隔区 2(ITS2)作为替代传统叶绿体标记物的方法。然而,ITS2 被广泛认为不够保守,或者受到杂交或双亲遗传模式的干扰,从而阻碍了其在系统发育重建或 DNA 条形码中的广泛应用。越来越多的证据表明,同时分析核苷酸数据和二级结构信息可以克服 ITS2 的至少一些局限性。本研究的目的是评估一种自动、序列-结构方法分析大量绿藻门 IT2 数据的可行性。
方法/主要发现:从 ITS2 数据库中获得的 591 种绿藻、741 种团藻和 938 种绿藻的序列和二级结构,均使用序列-结构特异性评分矩阵进行对齐。通过与序列-结构特异性的广义时间可逆取代模型相结合的轮廓邻居joining 重建系统发育关系。对 ITS2 数据进行分析的结果在多个节点上具有稳健性,并且与已发表的系统发育分析结果具有相当大的一致性。
结论/意义:我们对自动、序列-结构分析 ITS2 重建绿藻门系统发育的能力的观察结果验证了这种方法对评估大量绿藻类群多样性的有效性。此外,我们的结果表明,对于 DNA 条形码的使用,应该权衡对 ITS2 的反对意见与具有强大重建高度分化谱系进化模式能力的自动数据分析方法的实用性。