Bariah Inbar, Gribun Liel, Kashkush Khalil
Department of Life Sciences, Ben-Gurion University, Beer-Sheva, Israel.
Front Plant Sci. 2023 Jan 12;13:1072232. doi: 10.3389/fpls.2022.1072232. eCollection 2022.
Recent studies in wheat emphasized the importance of TEs, which occupy ~85% of the wheat genome, as a major source of intraspecific genetic variation due to their recent activity and involvement in genomic rearrangements. The contribution of TEs to structural and functional variations in bread wheat genes is not fully understood.
Here, publicly available RNA-Seq databases of bread wheat were integrated to identify TE insertions within gene bodies (exons\ introns) and assess the impact of TE insertions on gene expression variations of homoeologs gene groups. Overall, 70,818 homoeologs genes were analyzed: 55,170 genes appeared in each one of the three subgenomes (termed ABD), named triads; 12,640 genes appeared in two of the three subgenomes (in A and B only, termed AB; or in A and D only, termed AD; or in B and D only, termed BD);, named dyads; and 3,008 genes underwent duplication in one of the three subgenomes (two copies in: subgenome A, termed AABD; subgenome B, termed ABBD; or subgenome D, termed ABDD), named tetrads.
To this end, we found that ~36% of the 70,818 genes contained at least one TE insertion within the gene body, mostly in triads. Analysis of 14,258 triads revealed that the presence of TE insertion in at least one of the triad genes (7,439 triads) was associated with balanced expression (similar expression levels) between the homoeolog genes. TE insertions within the exon or in the untranslated regions (UTRs) of one or more of the homoeologs in a triad were significantly associated with homoeolog expression bias. Furthermore, we found a statistically significant correlation between the presence\absence of TEs insertions belonging to six TE superfamilies and 17 TE subfamilies and the suppression of a single homoeolog gene. A significant association was observed between the presence of TE insertions from specific superfamilies and the expression of genes that are associated with biotic and abiotic stress responses.
Our data strongly indicate that TEs might play a prominent role in controlling gene expression in a genome-specific manner in bread wheat.
最近对小麦的研究强调了转座元件(TEs)的重要性,转座元件占据了小麦基因组约85%的比例,由于其近期的活性以及参与基因组重排,它是种内遗传变异的主要来源。转座元件对面包小麦基因结构和功能变异的贡献尚未完全明确。
在此,整合了公开可用的面包小麦RNA-Seq数据库,以识别基因体内(外显子\内含子)的转座元件插入,并评估转座元件插入对同源基因群组基因表达变异的影响。总共分析了70,818个同源基因:55,170个基因出现在三个亚基因组(称为ABD)中的每一个中,称为三联体;12,640个基因出现在三个亚基因组中的两个中(仅在A和B中,称为AB;或仅在A和D中,称为AD;或仅在B和D中,称为BD),称为二联体;3,008个基因在三个亚基因组之一中发生了复制(在亚基因组A中有两个拷贝,称为AABD;在亚基因组B中有两个拷贝,称为ABBD;或在亚基因组D中有两个拷贝,称为ABDD),称为四联体。
为此,我们发现70,818个基因中约36%在基因体内至少含有一个转座元件插入,大多数存在于三联体中。对14,258个三联体的分析表明,三联体基因中至少有一个存在转座元件插入(7,439个三联体)与同源基因之间的平衡表达(相似的表达水平)相关。三联体中一个或多个同源基因的外显子或非翻译区(UTR)内的转座元件插入与同源基因表达偏向显著相关。此外,我们发现属于六个转座元件超家族和17个转座元件亚家族的转座元件插入的有无与单个同源基因的抑制之间存在统计学上的显著相关性。观察到特定超家族的转座元件插入的存在与生物和非生物胁迫响应相关基因的表达之间存在显著关联。
我们的数据有力地表明,转座元件可能在面包小麦中以基因组特异性方式控制基因表达方面发挥重要作用。