Provatas Kimonas, Chantzi Nikol, Patsakis Michail, Nayak Akshatha, Mouratidis Ioannis, Pavlopoulos Georgios A, Georgakopoulos-Soares Ilias
Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA.
Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA.
bioRxiv. 2024 Nov 13:2024.11.11.622808. doi: 10.1101/2024.11.11.622808.
Inverted repeats are repetitive elements that can form hairpin and cruciform structures. They are linked to genomic instability, however they also have various biological functions. Their distribution differs markedly across taxonomic groups in the tree of life, and they exhibit high polymorphism due to their inherent genomic instability. Advances in sequencing technologies and declined costs have enabled the generation of an ever-growing number of complete genomes for organisms across taxonomic groups in the tree of life. However, a comprehensive database encompassing inverted repeats across diverse organismal genomes has been lacking. We present InvertiaDB, the first comprehensive database of inverted repeats spanning multiple taxa, featuring repeats identified in the genomes of 118,070 organisms across all major taxonomic groups. The database currently hosts 30,067,666 inverted repeat sequences, serving as a centralized, user-friendly repository to perform searches, interactive visualization, and download existing inverted repeat data for independent analysis. invertiaDB is implemented as a web portal for browsing, analyzing and downloading inverted repeat data. invertiaDB is publicly available at https://invertiadb.netlify.app/homepage.html.
反向重复序列是能够形成发夹结构和十字形结构的重复元件。它们与基因组不稳定性相关,然而它们也具有多种生物学功能。它们在生命之树中的分类群间分布差异显著,并且由于其固有的基因组不稳定性而表现出高度多态性。测序技术的进步和成本的下降使得能够为生命之树中各个分类群的生物体生成越来越多的完整基因组。然而,一直缺乏一个涵盖不同生物体基因组中反向重复序列的综合数据库。我们展示了InvertiaDB,这是第一个涵盖多个分类群的反向重复序列综合数据库,其特点是包含在所有主要分类群的118,070个生物体基因组中鉴定出的重复序列。该数据库目前包含30,067,666个反向重复序列,作为一个集中的、用户友好的资源库,用于执行搜索、交互式可视化以及下载现有的反向重复序列数据以进行独立分析。InvertiaDB被实现为一个用于浏览、分析和下载反向重复序列数据的门户网站。InvertiaDB可在https://invertiadb.netlify.app/homepage.html上公开获取。