Mendel Centre for Plant Genomics and Proteomics, Central European Institute of Technology (CEITEC), Masaryk University, BrnoCZ-62500, Czech Republic.
National Centre for Biomolecular Research, Faculty of Science, Masaryk University, BrnoCZ-62500, Czech Republic.
Nucleic Acids Res. 2024 Jan 5;52(D1):D311-D321. doi: 10.1093/nar/gkad672.
Discoveries over the recent decade have demonstrated the unexpected diversity of telomere DNA motifs in nature. However, currently available resources, 'Telomerase database' and 'Plant rDNA database', contain just fragments of all relevant literature published over decades of telomere research as they have a different primary focus and limited updates. To fill this gap, we gathered data about telomere DNA sequences from a thorough literature screen as well as by analysing publicly available NGS data, and we created TeloBase (http://cfb.ceitec.muni.cz/telobase/) as a comprehensive database of information about telomere motif diversity. TeloBase is supplemented by internal taxonomy utilizing popular on-line taxonomic resources that enables in-house data filtration and graphical visualisation of telomere DNA evolutionary dynamics in the form of heat tree plots. TeloBase avoids overreliance on administrators for future data updates by having a simple form and community-curation system for application and approval, respectively, of new telomere sequences by users, which should ensure timeliness of the database and topicality. To demonstrate TeloBase utility, we examined telomere motif diversity in species from the fungal genus Aspergillus, and discovered (TTTATTAGGG)n sequence as a putative telomere motif in the plant family Chrysobalanaceae. This was bioinformatically confirmed by analysing template regions of identified telomerase RNAs.
在过去十年中,研究发现自然界中端粒 DNA 基序存在出乎意料的多样性。然而,目前可用的资源“端粒酶数据库”和“植物 rDNA 数据库”仅包含数十年来与端粒研究相关的所有文献的片段,因为它们具有不同的主要重点和有限的更新。为了填补这一空白,我们从全面的文献筛选以及分析公开可用的 NGS 数据中收集了有关端粒 DNA 序列的数据,并创建了 TeloBase(http://cfb.ceitec.muni.cz/telobase/)作为一个综合性的端粒基序多样性信息数据库。TeloBase 通过内部分类法进行补充,利用流行的在线分类资源,实现了端粒 DNA 进化动力学的内部数据过滤和图形可视化,以热树图的形式呈现。TeloBase 通过拥有一个简单的表格和社区审核系统,分别为用户申请和批准新的端粒序列,避免了对管理员的过度依赖,从而实现了未来数据更新,这应该确保数据库的及时性和主题性。为了展示 TeloBase 的实用性,我们研究了真菌属 Aspergillus 中物种的端粒基序多样性,并发现(TTTATTAGGG)n 序列是 Chrysobalanaceae 植物科中的一个假定的端粒基序。这通过分析鉴定出的端粒酶 RNA 的模板区域进行了生物信息学确认。