Guisinger Mary M, Kuehl Jennifer V, Boore Jeffrey L, Jansen Robert K
Section of Integrative Biology and Institute of Cellular and Molecular Biology, University of Texas, Austin TX 78712, USA.
Proc Natl Acad Sci U S A. 2008 Nov 25;105(47):18424-9. doi: 10.1073/pnas.0806759105. Epub 2008 Nov 14.
Angiosperm plastid genomes are generally conserved in gene content and order with rates of nucleotide substitutions for protein-coding genes lower than for nuclear protein-coding genes. A few groups have experienced genomic change, and extreme changes in gene content and order are found within the flowering plant family Geraniaceae. The complete plastid genome sequence of Pelargonium X hortorum (Geraniaceae) reveals the largest and most rearranged plastid genome identified to date. Highly elevated rates of sequence evolution in Geraniaceae mitochondrial genomes have been reported, but rates in Geraniaceae plastid genomes have not been characterized. Analysis of nucleotide substitution rates for 72 plastid genes for 47 angiosperm taxa, including nine Geraniaceae, show that values of dN are highly accelerated in ribosomal protein and RNA polymerase genes throughout the family. Furthermore, dN/dS is significantly elevated in the same two classes of plastid genes as well as in ATPase genes. A relatively high dN/dS ratio could be interpreted as evidence of two phenomena, namely positive or relaxed selection, neither of which is consistent with our current understanding of plastid genome evolution in photosynthetic plants. These analyses are the first to use protein-coding sequences from complete plastid genomes to characterize rates and patterns of sequence evolution for a broad sampling of photosynthetic angiosperms, and they reveal unprecedented accumulation of nucleotide substitutions in Geraniaceae. To explain these remarkable substitution patterns in the highly rearranged Geraniaceae plastid genomes, we propose a model of aberrant DNA repair coupled with altered gene expression.
被子植物的质体基因组在基因内容和顺序上通常是保守的,蛋白质编码基因的核苷酸替换率低于核蛋白编码基因。少数类群经历了基因组变化,在开花植物牻牛儿苗科中发现了基因内容和顺序的极端变化。天竺葵(牻牛儿苗科)的完整质体基因组序列揭示了迄今为止所鉴定出的最大且重排最多的质体基因组。据报道,牻牛儿苗科线粒体基因组的序列进化速率极高,但牻牛儿苗科质体基因组的速率尚未得到表征。对包括9种牻牛儿苗科植物在内的47个被子植物类群的72个质体基因的核苷酸替换率进行分析,结果表明,整个科的核糖体蛋白和RNA聚合酶基因中的非同义替换值(dN)显著加快。此外,在同样两类质体基因以及ATP酶基因中,dN/dS也显著升高。相对较高的dN/dS比值可以解释为两种现象的证据,即正选择或宽松选择,而这两种情况都与我们目前对光合植物质体基因组进化的理解不一致。这些分析首次使用完整质体基因组中的蛋白质编码序列来表征广泛采样的光合被子植物的序列进化速率和模式,并且揭示了牻牛儿苗科中前所未有的核苷酸替换积累。为了解释高度重排的牻牛儿苗科质体基因组中这些显著的替换模式,我们提出了一个异常DNA修复与基因表达改变相结合的模型。