Blazier John C, Jansen Robert K, Mower Jeffrey P, Govindu Madhu, Zhang Jin, Weng Mao-Lun, Ruhlman Tracey A
Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA.
Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA Department of Biological Science, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
Ann Bot. 2016 Jun;117(7):1209-20. doi: 10.1093/aob/mcw065. Epub 2016 Apr 28.
Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR.
We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus.
Erodium plastomes fell into four types (Type 1-4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR.
The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts.
质体、病毒和质粒等几个不相关的谱系已经汇聚到大小相似的四分体基因组,这些基因组具有大单拷贝区域、小单拷贝区域和一个大的反向重复序列(IR)。除了牻牛儿苗属(牻牛儿苗科)、树形仙人掌和一些豆科植物外,所有光合被子植物的质体基因组都呈现这种结构。IR的功能意义尚不清楚,牻牛儿苗属提供了一个系统来研究IR在这些基因组长期稳定性中的作用。我们比较了牻牛儿苗属质体基因组中存在和不存在IR时的基因组重排程度。
我们对17个新的牻牛儿苗属质体基因组进行了测序。利用454测序、Illumina测序、PacBio测序和桑格测序,组装并分类了16个基因组,以及一个不完整的和两个先前发表的牻牛儿苗属质体基因组。我们使用一个包含19个蛋白质编码基因的数据集对这些物种进行了系统发育分析,并确定是否是显著更高的进化速率导致了该属先前在系统发育重建中出现的长枝。还进行了生物信息学比较,以评估整个属的质体基因组进化。
牻牛儿苗属质体基因组分为四种类型(1 - 4型),它们在替换率、短分散重复序列含量、基因组重排程度、基因和内含子含量以及GC含量方面存在差异。4型质体基因组的所有基因同义替换率(dS)显著更高,并且19个基因中的14个基因非同义替换率(dN)显著加快。我们评估了牻牛儿苗属中单个IR缺失的证据,在此过程中发现4型质体基因组包含一个新的IR。
IR的存在与否并不影响牻牛儿苗属质体基因组的稳定性。相反,总体重复序列含量与基因组稳定性呈负相关,这一模式与其他被子植物类群以及近期关于细菌内共生体基因组稳定性的研究结果一致。