• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CESAR 2.0 极大地提高了比较基因注释的速度和准确性。

CESAR 2.0 substantially improves speed and accuracy of comparative gene annotation.

机构信息

Max Planck Institute of Molecular Cell Biology and Genetics, Dresden 01307, Germany.

Max Planck Institute for the Physics of Complex Systems, Dresden 01187, Germany.

出版信息

Bioinformatics. 2017 Dec 15;33(24):3985-3987. doi: 10.1093/bioinformatics/btx527.

DOI:10.1093/bioinformatics/btx527
PMID:28961744
Abstract

MOTIVATION

Homology-based gene prediction is a powerful concept to annotate newly sequenced genomes. We have previously demonstrated that whole genome alignments can be utilized for accurate comparative coding gene annotation.

RESULTS

Here we present CESAR 2.0 that utilizes genome alignments to transfer coding gene annotations from one reference to many other aligned genomes. We show that CESAR 2.0 is 77 times faster and requires 31 times less memory compared to its predecessor. CESAR 2.0 substantially improves the ability to align splice sites that have shifted over larger distances, allowing for precise identification of the exon boundaries in the aligned genome. Finally, CESAR 2.0 supports entire genes, which enables the annotation of joined exons that arose by complete intron deletions. CESAR 2.0 can readily be applied to new genome alignments to annotate coding genes in many other genomes at improved accuracy and without necessitating large-computational resources.

AVAILABILITY AND IMPLEMENTATION

Source code is freely available at https://github.com/hillerlab/CESAR2.0.

CONTACT

hiller@mpi-cbg.de.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基于同源性的基因预测是注释新测序基因组的强大概念。我们之前已经证明,全基因组比对可用于准确的比较编码基因注释。

结果

这里我们展示了 CESAR 2.0,它利用基因组比对将编码基因注释从一个参考基因组转移到许多其他对齐的基因组。我们表明,CESAR 2.0 比其前身快 77 倍,所需的内存少 31 倍。CESAR 2.0 极大地提高了对齐跨越较大距离的剪接位点的能力,从而能够精确识别对齐基因组中的外显子边界。最后,CESAR 2.0 支持整个基因,从而能够注释由完全内含子缺失产生的连接外显子。CESAR 2.0 可以轻松应用于新的基因组比对,以提高准确性并无需大量计算资源的情况下注释许多其他基因组中的编码基因。

可用性和实现

源代码可在 https://github.com/hillerlab/CESAR2.0 上免费获得。

联系人

hiller@mpi-cbg.de。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
CESAR 2.0 substantially improves speed and accuracy of comparative gene annotation.CESAR 2.0 极大地提高了比较基因注释的速度和准确性。
Bioinformatics. 2017 Dec 15;33(24):3985-3987. doi: 10.1093/bioinformatics/btx527.
2
Coding exon-structure aware realigner (CESAR) utilizes genome alignments for accurate comparative gene annotation.编码外显子结构感知重排器(CESAR)利用基因组比对进行准确的比较基因注释。
Nucleic Acids Res. 2016 Jun 20;44(11):e103. doi: 10.1093/nar/gkw210. Epub 2016 Mar 25.
3
Coding Exon-Structure Aware Realigner (CESAR): Utilizing Genome Alignments for Comparative Gene Annotation.编码外显子结构感知重排器(CESAR):利用基因组比对进行比较基因注释。
Methods Mol Biol. 2019;1962:179-191. doi: 10.1007/978-1-4939-9173-0_10.
4
Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation.提高比对灵敏度可改善基因组比对在比较基因注释中的应用。
Nucleic Acids Res. 2017 Aug 21;45(14):8369-8377. doi: 10.1093/nar/gkx554.
5
chainCleaner improves genome alignment specificity and sensitivity.链清洁器提高了基因组比对的特异性和灵敏度。
Bioinformatics. 2017 Jun 1;33(11):1596-1603. doi: 10.1093/bioinformatics/btx024.
6
A genome alignment of 120 mammals highlights ultraconserved element variability and placenta-associated enhancers.120 种哺乳动物的基因组比对突出了超保守元件的可变性和胎盘相关增强子。
Gigascience. 2020 Jan 1;9(1). doi: 10.1093/gigascience/giz159.
7
Splice2Deep: An ensemble of deep convolutional neural networks for improved splice site prediction in genomic DNA.Splice2Deep:用于改进基因组DNA中剪接位点预测的深度卷积神经网络集成方法。
Gene. 2020 Dec;763S:100035. doi: 10.1016/j.gene.2020.100035. Epub 2020 May 13.
8
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.transAlign:利用氨基酸促进蛋白质编码DNA序列的多重比对。
BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.
9
FunGAP: Fungal Genome Annotation Pipeline using evidence-based gene model evaluation.FunGAP:基于证据的基因模型评估的真菌基因组注释流水线。
Bioinformatics. 2017 Sep 15;33(18):2936-2937. doi: 10.1093/bioinformatics/btx353.
10
Simultaneous gene finding in multiple genomes.在多个基因组中同时进行基因发现。
Bioinformatics. 2016 Nov 15;32(22):3388-3395. doi: 10.1093/bioinformatics/btw494. Epub 2016 Jul 27.

引用本文的文献

1
Conservation assessment of human splice site annotation based on a 470-genome alignment.基于470个基因组比对的人类剪接位点注释的保守性评估。
Nucleic Acids Res. 2025 Mar 20;53(6). doi: 10.1093/nar/gkaf184.
2
Combining DNA and protein alignments to improve genome annotation with LiftOn.结合DNA和蛋白质比对,利用LiftOn改进基因组注释。
Genome Res. 2025 Feb 14;35(2):311-325. doi: 10.1101/gr.279620.124.
3
Conservation assessment of human splice site annotation based on a 470-genome alignment.基于470个基因组比对的人类剪接位点注释的保守性评估
bioRxiv. 2025 Mar 15:2023.12.01.569581. doi: 10.1101/2023.12.01.569581.
4
High-quality haploid genomes corroborate 29 chromosomes and highly conserved synteny of genes in Hyles hawkmoths (Lepidoptera: Sphingidae).高质量的单倍体基因组证实了天蛾(鳞翅目:天蛾科)的 29 条染色体和高度保守的基因同线性。
BMC Genomics. 2023 Aug 7;24(1):443. doi: 10.1186/s12864-023-09506-y.
5
Integrating gene annotation with orthology inference at scale.大规模整合基因注释与直系同源推断。
Science. 2023 Apr 28;380(6643):eabn3107. doi: 10.1126/science.abn3107.
6
Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals.数千个人类非 AUG 延伸蛋白缺乏哺乳动物进化选择的证据。
Nat Commun. 2022 Dec 23;13(1):7910. doi: 10.1038/s41467-022-35595-6.
7
Birth-and-death evolution of ribonuclease 9 genes in Cetartiodactyla.Cetartiodactyla 中核糖核酸酶 9 基因的诞生与消亡进化。
Sci China Life Sci. 2023 May;66(5):1170-1182. doi: 10.1007/s11427-022-2195-x. Epub 2022 Nov 25.
8
Vision-related convergent gene losses reveal 's unknown role in the eye.相关视觉的趋同基因缺失揭示了“s”在眼睛中的未知作用。
Elife. 2022 Jun 21;11:e77999. doi: 10.7554/eLife.77999.
9
Gene losses in the common vampire bat illuminate molecular adaptations to blood feeding.普通吸血蝙蝠的基因缺失揭示了其对吸血行为的分子适应机制。
Sci Adv. 2022 Mar 25;8(12):eabm6494. doi: 10.1126/sciadv.abm6494.
10
Gene losses may contribute to subterranean adaptations in naked mole-rat and blind mole-rat.基因缺失可能有助于裸鼹鼠和盲鼹鼠的地下适应。
BMC Biol. 2022 Feb 17;20(1):44. doi: 10.1186/s12915-022-01243-0.