Boyce Thompson Institute for Plant Research, Tower Road, Ithaca, NY 14853-1801, USA.
BMC Genomics. 2012 Aug 17;13:406. doi: 10.1186/1471-2164-13-406.
Polyploidization is an important mechanism in plant evolution. By analyzing the leaf transcriptomes taken from the allotetraploid Nicotiana tabacum (tobacco) and parental genome donors, N. sylvesteris (S-Genome) and N. tomentosiformis (T-Genome), a phylogenomic approach was taken to map the fate of homeologous gene pairs in this plant.
A comparison between the genes present in the leaf transcriptomes of N. tabacum and modern day representatives of its progenitor species demonstrated that only 33% of assembled transcripts could be distinguished based on their sequences. A large majority of the genes (83.6% of the non parent distinguishable and 87.2% of the phylogenetic topology analyzed clusters) expressed above background level (more than 5 reads) showed similar overall expression levels. Homeologous sequences could be identified for 968 gene clusters, and 90% (6% of all genes) of the set maintained expression of only one of the tobacco homeologs. When both homeologs were expressed, only 15% (0.5% of the total) showed evidence of differential expression, providing limited evidence of subfunctionalization. Comparing the rate of synonymous nucleotide substitution (Ks) and non-synonymous nucleotide substitution (Kn) provided limited evidence for positive selection during the evolution of tobacco since the polyploidization event took place.
Polyploidization is a powerful mechanism for plant speciation that can occur during one generation; however millions of generations may be necessary for duplicate genes to acquire a new function. Analysis of the tobacco leaf transcriptome reveals that polyploidization, even in a young tetraploid such as tobacco, can lead to complex changes in gene expression. Gene loss and gene silencing, or subfunctionalization may explain why both homeologs are not expressed by the associated genes. With Whole Genome Duplication (WGD) events, polyploid genomes usually maintain a high percentage of gene duplicates. The data provided little evidence of preferential maintenance of gene expression from either the T- or S-genome. Additionally there was little evidence of neofunctionalization in Nicotiana tabacum suggesting it occurs at a low frequency in young polyploidy.
多倍化是植物进化中的一个重要机制。通过分析来自异源四倍体烟草(烟草)及其亲本基因组供体 N. sylvesteris(S 基因组)和 N. tomentosiformis(T 基因组)的叶片转录组,采用系统基因组学方法来映射该植物中同源基因对的命运。
将烟草叶片转录组中的基因与现代其祖先物种的代表进行比较,结果表明,只有 33%的组装转录本可以根据其序列区分开来。大多数基因(83.6%的不可区分亲本和 87.2%分析的系统发育拓扑聚类)表达水平高于背景水平(超过 5 个读数),表现出相似的整体表达水平。可以鉴定出 968 个基因簇的同源序列,并且该组的 90%(所有基因的 6%)仅表达烟草同源物中的一个。当两个同源物都表达时,只有 15%(总共有 0.5%)表现出差异表达的证据,这提供了亚功能化的有限证据。比较同义核苷酸替换(Ks)和非同义核苷酸替换(Kn)的速率为烟草多倍化事件发生后提供了有限的正选择进化证据。
多倍化是植物物种形成的强大机制,可以在一代中发生;然而,可能需要数百万代才能使重复基因获得新功能。烟草叶片转录组的分析表明,多倍化甚至在像烟草这样的年轻四倍体中,也会导致基因表达的复杂变化。基因丢失和基因沉默,或亚功能化可能解释了为什么相关基因没有表达两个同源物。随着全基因组加倍(WGD)事件,多倍体基因组通常会保留大量的基因副本。数据几乎没有提供证据表明 T-或 S-基因组优先维持基因表达。此外,在烟草中几乎没有证据表明新功能化,这表明它在年轻的多倍体中发生的频率较低。