İzmir International Biomedicine and Genome Institute, Dokuz Eylül University, İnciraltı, İzmir, Turkey.
Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey.
Methods Mol Biol. 2023;2703:59-70. doi: 10.1007/978-1-0716-3389-2_5.
Transposable elements (TEs) are repeat elements that can relocate or create novel copies of themselves in the genome and contribute to genomic complexity and expansion, via events such as chromosome recombination or regulation of gene expression. However, given the large number of such repeats across the genome, identifying repeats of interest can be a challenge in even well-annotated genomes, especially in more complex, TE-rich plant genomes. Here, we describe a protocol for PlanTEnrichment, a database we created comprising information on 11 plant genomes to analyze stress-associated TEs using publicly available data. By selecting a genome and providing a list of genes or genomic regions whose TE associations the user wants to identify, the user can rapidly obtain TE subfamilies found near the provided regions, as well as their superfamily and class, and the enrichment values of the repeats. The results also provide the locations of individual repeat instances found, alongside the input regions or genes they are associated with, and a bar graph of the top ten most significant repeat subfamilies identified. PlanTEnrichment is freely available at http://tools.ibg.deu.edu.tr/plantenrichment/ and can be used by researchers with rudimentary or no proficiency in computational analysis of TE elements, allowing for expedience in the identification of TEs of interest and helping further our understanding of the potential contributions of TEs in plant genomes.
转座元件 (TEs) 是重复元件,可在基因组中重新定位或自行复制新的拷贝,通过染色体重组或基因表达调控等事件,导致基因组的复杂性和扩张。然而,由于基因组中存在大量此类重复序列,即使在注释良好的基因组中,识别感兴趣的重复序列也是一项挑战,尤其是在更复杂、富含 TEs 的植物基因组中。在这里,我们描述了一种名为 PlanTEnrichment 的数据库创建协议,该数据库包含 11 种植物基因组的信息,可使用公开数据分析与应激相关的 TEs。用户可以通过选择一个基因组并提供一个希望识别其 TE 关联的基因或基因组区域列表,快速获得提供区域附近发现的 TE 亚家族,以及它们的超家族和类别,以及重复序列的富集值。结果还提供了发现的单个重复实例的位置,以及它们与输入区域或基因的关联,并提供了识别出的前十个最重要的重复亚家族的条形图。PlanTEnrichment 可在 http://tools.ibg.deu.edu.tr/plantenrichment/ 免费获得,即使是对 TE 元素的计算分析没有基本或没有专业知识的研究人员也可以使用,这有助于快速识别感兴趣的 TE,并帮助我们进一步了解 TE 在植物基因组中的潜在贡献。