Suppr超能文献

结合DNA和蛋白质比对,利用LiftOn改进基因组注释。

Combining DNA and protein alignments to improve genome annotation with LiftOn.

作者信息

Chao Kuan-Hao, Heinz Jakob M, Hoh Celine, Mao Alan, Shumate Alaina, Pertea Mihaela, Salzberg Steven L

机构信息

Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA.

Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21218, USA.

出版信息

bioRxiv. 2024 May 17:2024.05.16.593026. doi: 10.1101/2024.05.16.593026.

Abstract

As the number and variety of assembled genomes continues to grow, the number of annotated genomes is falling behind, particularly for eukaryotes. DNA-based mapping tools help to address this challenge, but they are only able to transfer annotation between closely-related species. Here we introduce LiftOn, a homology-based software tool that integrates DNA and protein alignments to enhance the accuracy of genome-scale annotation and to allow mapping between relatively distant species. LiftOn's protein-centric algorithm considers both types of alignments, chooses optimal open reading frames, resolves overlapping gene loci, and finds additional gene copies where they exist. LiftOn can reliably transfer annotation between genomes representing members of the same species, as we demonstrate on human, mouse, honey bee, rice, and . It can further map annotation effectively across species pairs as far apart as mouse and rat or and .

摘要

随着已组装基因组的数量和种类不断增加,注释基因组的数量却滞后了,尤其是对于真核生物而言。基于DNA的映射工具有助于应对这一挑战,但它们只能在亲缘关系密切的物种之间转移注释。在此,我们介绍LiftOn,这是一种基于同源性的软件工具,它整合了DNA和蛋白质比对,以提高基因组规模注释的准确性,并允许在亲缘关系相对较远的物种之间进行映射。LiftOn以蛋白质为中心的算法会考虑两种比对类型,选择最佳开放阅读框,解析重叠基因座,并在存在额外基因拷贝的地方找到它们。正如我们在人类、小鼠、蜜蜂、水稻等物种上所展示的,LiftOn能够在代表同一物种成员的基因组之间可靠地转移注释。它还能进一步有效地跨物种对进行注释映射,比如小鼠和大鼠或其他物种对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a4e/11118573/ce4444b6fe3a/nihpp-2024.05.16.593026v1-f0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验