• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合DNA和蛋白质比对,利用LiftOn改进基因组注释。

Combining DNA and protein alignments to improve genome annotation with LiftOn.

作者信息

Chao Kuan-Hao, Heinz Jakob M, Hoh Celine, Mao Alan, Shumate Alaina, Pertea Mihaela, Salzberg Steven L

机构信息

Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA;

Center for Computational Biology, Johns Hopkins University, Baltimore, Maryland 21218, USA.

出版信息

Genome Res. 2025 Feb 14;35(2):311-325. doi: 10.1101/gr.279620.124.

DOI:10.1101/gr.279620.124
PMID:39730188
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11874971/
Abstract

As the number and variety of assembled genomes continue to grow, the number of annotated genomes is falling behind, particularly for eukaryotes. DNA-based mapping tools help to address this challenge, but they are only able to transfer annotation between closely related species. Here we introduce LiftOn, a homology-based software tool that integrates DNA and protein alignments to enhance the accuracy of genome-scale annotation and to allow mapping between relatively distant species. LiftOn's protein-centric algorithm considers both types of alignments, chooses optimal open reading frames, resolves overlapping gene loci, and finds additional gene copies when they exist. LiftOn can reliably transfer annotation between genomes representing members of the same species, as we demonstrate on human, mouse, honeybee, rice, and It can further map annotation effectively across species pairs as far apart as mouse and rat or and .

摘要

随着已组装基因组的数量和种类不断增加,注释基因组的数量却滞后了,尤其是对于真核生物而言。基于DNA的映射工具有助于应对这一挑战,但它们只能在亲缘关系密切的物种之间转移注释。在此,我们介绍LiftOn,这是一种基于同源性的软件工具,它整合了DNA和蛋白质比对,以提高基因组规模注释的准确性,并允许在亲缘关系相对较远的物种之间进行映射。LiftOn以蛋白质为中心的算法考虑了两种比对类型,选择最佳开放阅读框,解析重叠基因座,并在存在额外基因拷贝时找到它们。正如我们在人类、小鼠、蜜蜂、水稻以及……上所展示的,LiftOn能够在代表同一物种成员的基因组之间可靠地转移注释。它还能进一步有效地跨物种对进行注释映射,比如小鼠和大鼠或……和……之间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/8d25c7362380/311f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/0140289074e6/311f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/b0a4ef3bcdb3/311f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/a3fc012203b8/311f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/7dee245f58db/311f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/8d25c7362380/311f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/0140289074e6/311f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/b0a4ef3bcdb3/311f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/a3fc012203b8/311f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/7dee245f58db/311f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c020/11874971/8d25c7362380/311f05.jpg

相似文献

1
Combining DNA and protein alignments to improve genome annotation with LiftOn.结合DNA和蛋白质比对,利用LiftOn改进基因组注释。
Genome Res. 2025 Feb 14;35(2):311-325. doi: 10.1101/gr.279620.124.
2
Combining DNA and protein alignments to improve genome annotation with LiftOn.结合DNA和蛋白质比对,利用LiftOn改进基因组注释。
bioRxiv. 2024 May 17:2024.05.16.593026. doi: 10.1101/2024.05.16.593026.
3
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
4
Short-Term Memory Impairment短期记忆障碍
5
Fabricating mice and dementia: opening up relations in multi-species research制造小鼠与痴呆症:开启多物种研究中的关联
6
High-Performance Genome Annotation for a Safer and Faster-Developing Phage Therapy.用于更安全、发展更快的噬菌体疗法的高性能基因组注释。
Viruses. 2025 Feb 25;17(3):314. doi: 10.3390/v17030314.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Sexual Harassment and Prevention Training性骚扰与预防培训
9
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
10
Alevin-fry-atac enables rapid and memory frugal mapping of single-cell ATAC-seq data using virtual colors for accurate genomic pseudoalignment.Alevin-fry-atac可使用虚拟颜色实现单细胞ATAC-seq数据的快速且节省内存的映射,以进行准确的基因组伪比对。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i237-i245. doi: 10.1093/bioinformatics/btaf234.

本文引用的文献

1
LiftoffTools: a toolkit for comparing gene annotations mapped between genome assemblies.LiftoffTools:一个用于比较基因组组装之间映射的基因注释的工具包。
F1000Res. 2024 Apr 29;11:1230. doi: 10.12688/f1000research.124059.2. eCollection 2022.
2
CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure.CHESS 3:基于大规模表达数据、系统发育分析和蛋白质结构,改进和综合的人类基因和转录本目录。
Genome Biol. 2023 Oct 30;24(1):249. doi: 10.1186/s13059-023-03088-4.
3
The complete sequence of a human Y chromosome.
人类 Y 染色体的完整序列。
Nature. 2023 Sep;621(7978):344-354. doi: 10.1038/s41586-023-06457-y. Epub 2023 Aug 23.
4
Reference genomes for BALB/c Nude and NOD/SCID mouse models.BALB/c Nude 和 NOD/SCID 小鼠模型的参考基因组。
G3 (Bethesda). 2023 Sep 30;13(10). doi: 10.1093/g3journal/jkad188.
5
A complete assembly of the rice Nipponbare reference genome.日本晴水稻参考基因组的完整组装。
Mol Plant. 2023 Aug 7;16(8):1232-1236. doi: 10.1016/j.molp.2023.08.003.
6
Genomic sketching with multiplicities and locality-sensitive hashing using Dashing 2.使用 Dashing 2 进行多重性和位置敏感哈希的基因组草图绘制。
Genome Res. 2023 Jul;33(7):1218-1227. doi: 10.1101/gr.277655.123. Epub 2023 Jul 6.
7
Protein-to-genome alignment with miniprot.用 Miniprot 进行蛋白质到基因组的比对。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad014.
8
Method of the year: long-read sequencing.年度方法:长读长测序。
Nat Methods. 2023 Jan;20(1):6-11. doi: 10.1038/s41592-022-01730-w.
9
Approaching complete genomes, transcriptomes and epi-omes with accurate long-read sequencing.采用准确的长读测序技术获取完整的基因组、转录组和表观基因组。
Nat Methods. 2023 Jan;20(1):12-16. doi: 10.1038/s41592-022-01716-8.
10
The first gapless, reference-quality, fully annotated genome from a Southern Han Chinese individual.首个无缺口、参考质量、完整注释的中国南方汉族个体基因组。
G3 (Bethesda). 2023 Mar 9;13(3). doi: 10.1093/g3journal/jkac321.