Suppr超能文献

反向重复序列数据库:一个跨生物体基因组的反向重复序列数据库。

invertiaDB: A Database of Inverted Repeats Across Organismal Genomes.

作者信息

Provatas Kimonas, Chantzi Nikol, Patsakis Michail, Nayak Akshatha, Mouratidis Ioannis, Pavlopoulos Georgios A, Georgakopoulos-Soares Ilias

机构信息

Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA.

Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA.

出版信息

bioRxiv. 2024 Nov 13:2024.11.11.622808. doi: 10.1101/2024.11.11.622808.

Abstract

Inverted repeats are repetitive elements that can form hairpin and cruciform structures. They are linked to genomic instability, however they also have various biological functions. Their distribution differs markedly across taxonomic groups in the tree of life, and they exhibit high polymorphism due to their inherent genomic instability. Advances in sequencing technologies and declined costs have enabled the generation of an ever-growing number of complete genomes for organisms across taxonomic groups in the tree of life. However, a comprehensive database encompassing inverted repeats across diverse organismal genomes has been lacking. We present InvertiaDB, the first comprehensive database of inverted repeats spanning multiple taxa, featuring repeats identified in the genomes of 118,070 organisms across all major taxonomic groups. The database currently hosts 30,067,666 inverted repeat sequences, serving as a centralized, user-friendly repository to perform searches, interactive visualization, and download existing inverted repeat data for independent analysis. invertiaDB is implemented as a web portal for browsing, analyzing and downloading inverted repeat data. invertiaDB is publicly available at https://invertiadb.netlify.app/homepage.html.

摘要

反向重复序列是能够形成发夹结构和十字形结构的重复元件。它们与基因组不稳定性相关,然而它们也具有多种生物学功能。它们在生命之树中的分类群间分布差异显著,并且由于其固有的基因组不稳定性而表现出高度多态性。测序技术的进步和成本的下降使得能够为生命之树中各个分类群的生物体生成越来越多的完整基因组。然而,一直缺乏一个涵盖不同生物体基因组中反向重复序列的综合数据库。我们展示了InvertiaDB,这是第一个涵盖多个分类群的反向重复序列综合数据库,其特点是包含在所有主要分类群的118,070个生物体基因组中鉴定出的重复序列。该数据库目前包含30,067,666个反向重复序列,作为一个集中的、用户友好的资源库,用于执行搜索、交互式可视化以及下载现有的反向重复序列数据以进行独立分析。InvertiaDB被实现为一个用于浏览、分析和下载反向重复序列数据的门户网站。InvertiaDB可在https://invertiadb.netlify.app/homepage.html上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5048/11601276/f90367b0f8e4/nihpp-2024.11.11.622808v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验