Suppr超能文献

植物转座元件富集分析:一种在选定植物基因组中与目标区域相关的转座元件快速鉴定的操作指南。

PlanTEnrichment: A How-to Guide on Rapid Identification of Transposable Elements Associated with Regions of Interest in Select Plant Genomes.

机构信息

İzmir International Biomedicine and Genome Institute, Dokuz Eylül University, İnciraltı, İzmir, Turkey.

Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey.

出版信息

Methods Mol Biol. 2023;2703:59-70. doi: 10.1007/978-1-0716-3389-2_5.

Abstract

Transposable elements (TEs) are repeat elements that can relocate or create novel copies of themselves in the genome and contribute to genomic complexity and expansion, via events such as chromosome recombination or regulation of gene expression. However, given the large number of such repeats across the genome, identifying repeats of interest can be a challenge in even well-annotated genomes, especially in more complex, TE-rich plant genomes. Here, we describe a protocol for PlanTEnrichment, a database we created comprising information on 11 plant genomes to analyze stress-associated TEs using publicly available data. By selecting a genome and providing a list of genes or genomic regions whose TE associations the user wants to identify, the user can rapidly obtain TE subfamilies found near the provided regions, as well as their superfamily and class, and the enrichment values of the repeats. The results also provide the locations of individual repeat instances found, alongside the input regions or genes they are associated with, and a bar graph of the top ten most significant repeat subfamilies identified. PlanTEnrichment is freely available at http://tools.ibg.deu.edu.tr/plantenrichment/ and can be used by researchers with rudimentary or no proficiency in computational analysis of TE elements, allowing for expedience in the identification of TEs of interest and helping further our understanding of the potential contributions of TEs in plant genomes.

摘要

转座元件 (TEs) 是重复元件,可在基因组中重新定位或自行复制新的拷贝,通过染色体重组或基因表达调控等事件,导致基因组的复杂性和扩张。然而,由于基因组中存在大量此类重复序列,即使在注释良好的基因组中,识别感兴趣的重复序列也是一项挑战,尤其是在更复杂、富含 TEs 的植物基因组中。在这里,我们描述了一种名为 PlanTEnrichment 的数据库创建协议,该数据库包含 11 种植物基因组的信息,可使用公开数据分析与应激相关的 TEs。用户可以通过选择一个基因组并提供一个希望识别其 TE 关联的基因或基因组区域列表,快速获得提供区域附近发现的 TE 亚家族,以及它们的超家族和类别,以及重复序列的富集值。结果还提供了发现的单个重复实例的位置,以及它们与输入区域或基因的关联,并提供了识别出的前十个最重要的重复亚家族的条形图。PlanTEnrichment 可在 http://tools.ibg.deu.edu.tr/plantenrichment/ 免费获得,即使是对 TE 元素的计算分析没有基本或没有专业知识的研究人员也可以使用,这有助于快速识别感兴趣的 TE,并帮助我们进一步了解 TE 在植物基因组中的潜在贡献。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验