Pingault Lise, Choulet Frédéric, Alberti Adriana, Glover Natasha, Wincker Patrick, Feuillet Catherine, Paux Etienne
Genome Biol. 2015 Feb 10;16(1):29. doi: 10.1186/s13059-015-0601-9.
Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before.
By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level.
Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
由于其基因组大小、异源六倍体性质以及高重复含量,普通小麦基因组是研究基因组结构对基因组织、功能和调控影响的良好模型。然而,由于缺乏参考基因组序列,此类研究长期受到阻碍,我们对小麦基因空间的了解仍然有限。小麦3B染色体参考序列的获得为我们提供了一个机会,能够以前所未有的水平研究小麦转录组及其与基因组和基因结构的关系。
通过将该序列与RNA测序数据相结合,我们构建了3B染色体的精细转录组图谱。鉴定出超过8800个转录位点,它们分布在整个染色体上。我们研究了基因的表达水平、表达广度、可变剪接以及包括转录本长度、外显子数量和内含子累积长度在内的几个结构特征。我们的分析揭示了基因表达与结构之间的非单调关系,并得出一个假设,即基因结构由其功能决定,而基因表达受能量成本的影响。此外,我们在基因结构和功能水平上观察到基于重组的划分。
我们的分析为基因与基因组结构和功能之间的关系提供了新的见解。它揭示了与其他植物物种共有的机制以及塑造小麦基因空间的叠加进化力量,这些力量可能参与了小麦的适应性。