Stival Sena Juliana, Giguère Isabelle, Boyle Brian, Rigault Philippe, Birol Inanc, Zuccolo Andrea, Ritland Kermit, Ritland Carol, Bohlmann Joerg, Jones Steven, Bousquet Jean, Mackay John
Center for Forest Research and Institute for Systems and Integrative Biology, 1030 rue de la Médecine, Université Laval, Québec, QC G1V 0A6, Canada.
BMC Plant Biol. 2014 Apr 16;14:95. doi: 10.1186/1471-2229-14-95.
A positive relationship between genome size and intron length is observed across eukaryotes including Angiosperms plants, indicating a co-evolution of genome size and gene structure. Conifers have very large genomes and longer introns on average than most plants, but impacts of their large genome and longer introns on gene structure has not be described.
Gene structure was analyzed for 35 genes of Picea glauca obtained from BAC sequencing and genome assembly, including comparisons with A. thaliana, P. trichocarpa and Z. mays. We aimed to develop an understanding of impact of long introns on the structure of individual genes. The number and length of exons was well conserved among the species compared but on average, P. glauca introns were longer and genes had four times more intronic sequence than Arabidopsis, and 2 times more than poplar and maize. However, pairwise comparisons of individual genes gave variable results and not all contrasts were statistically significant. Genes generally accumulated one or a few longer introns in species with larger genomes but the position of long introns was variable between plant lineages. In P. glauca, highly expressed genes generally had more intronic sequence than tissue preferential genes. Comparisons with the Pinus taeda BACs and genome scaffolds showed a high conservation for position of long introns and for sequence of short introns. A survey of 1836 P. glauca genes obtained by sequence capture mostly containing introns <1 Kbp showed that repeated sequences were 10× more abundant in introns than in exons.
Conifers have large amounts of intronic sequence per gene for seed plants due to the presence of few long introns and repetitive element sequences are ubiquitous in their introns. Results indicate a complex landscape of intron sizes and distribution across taxa and between genes with different expression profiles.
在包括被子植物在内的真核生物中,观察到基因组大小与内含子长度之间存在正相关关系,这表明基因组大小与基因结构共同进化。针叶树的基因组非常大,平均内含子比大多数植物更长,但其大基因组和较长内含子对基因结构的影响尚未见描述。
对从BAC测序和基因组组装获得的白云杉35个基因的基因结构进行了分析,包括与拟南芥、毛果杨和玉米的比较。我们旨在了解长内含子对单个基因结构的影响。在所比较的物种中,外显子的数量和长度保守性良好,但平均而言,白云杉的内含子更长,其基因的内含子序列比拟南芥多四倍,比杨树和玉米多两倍。然而,单个基因的成对比较结果各异,并非所有对比都具有统计学意义。在基因组较大的物种中,基因通常积累一个或几个较长的内含子,但长内含子的位置在植物谱系之间是可变的。在白云杉中,高表达基因的内含子序列通常比组织偏好性基因更多。与火炬松BAC和基因组支架的比较表明,长内含子的位置和短内含子的序列具有高度保守性。对通过序列捕获获得的1836个白云杉基因(大多含有长度小于1千碱基对的内含子)的调查显示,内含子中的重复序列比外显子中的丰富10倍。
由于存在少数长内含子,针叶树每个基因含有大量内含子序列,并且重复元件序列在其内含子中普遍存在。结果表明,内含子大小以及跨分类群和具有不同表达谱的基因之间的分布情况复杂。