• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较注释工具包 (CAT)-同时进行进化枝和个人基因组注释。

Comparative Annotation Toolkit (CAT)-simultaneous clade and personal genome annotation.

机构信息

Genomics Institute, University of California Santa Cruz and Howard Hughes Medical Institute, Santa Cruz, California 95064, USA.

10x Genomics, Pleasanton, California 94566, USA.

出版信息

Genome Res. 2018 Jul;28(7):1029-1038. doi: 10.1101/gr.233460.117. Epub 2018 Jun 8.

DOI:10.1101/gr.233460.117
PMID:29884752
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6028123/
Abstract

The recent introductions of low-cost, long-read, and read-cloud sequencing technologies coupled with intense efforts to develop efficient algorithms have made affordable, high-quality de novo sequence assembly a realistic proposition. The result is an explosion of new, ultracontiguous genome assemblies. To compare these genomes, we need robust methods for genome annotation. We describe the fully open source Comparative Annotation Toolkit (CAT), which provides a flexible way to simultaneously annotate entire clades and identify orthology relationships. We show that CAT can be used to improve annotations on the rat genome, annotate the great apes, annotate a diverse set of mammals, and annotate personal, diploid human genomes. We demonstrate the resulting discovery of novel genes, isoforms, and structural variants-even in genomes as well studied as rat and the great apes-and how these annotations improve cross-species RNA expression experiments.

摘要

最近推出的低成本、长读长和云读测序技术,加上开发高效算法的努力,使得负担得起的高质量从头序列组装成为现实。其结果是新的超连续基因组组装的爆炸式增长。为了比较这些基因组,我们需要稳健的基因组注释方法。我们描述了完全开源的比较注释工具包(CAT),它提供了一种灵活的方法,可以同时注释整个进化枝并识别同源关系。我们表明 CAT 可以用于改进大鼠基因组的注释,注释大猿类,注释一组多样化的哺乳动物,以及注释个人的二倍体人类基因组。我们展示了由此产生的新基因、异构体和结构变异的发现,即使在像大鼠和大猿这样研究充分的基因组中,以及这些注释如何改进跨物种 RNA 表达实验。

相似文献

1
Comparative Annotation Toolkit (CAT)-simultaneous clade and personal genome annotation.比较注释工具包 (CAT)-同时进行进化枝和个人基因组注释。
Genome Res. 2018 Jul;28(7):1029-1038. doi: 10.1101/gr.233460.117. Epub 2018 Jun 8.
2
A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples.利用单倍体或二倍体样本进行膜翅目基因组从头组装方法的比较分析。
Sci Rep. 2019 Apr 24;9(1):6480. doi: 10.1038/s41598-019-42795-6.
3
Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms.优化从头转录组组装从高通量短读测序数据提高非模式生物的功能注释。
BMC Bioinformatics. 2012 Jul 18;13:170. doi: 10.1186/1471-2105-13-170.
4
A robust (re-)annotation approach to generate unbiased mapping references for RNA-seq-based analyses of differential expression across closely related species.一种强大的(重新)注释方法,用于为基于RNA测序的密切相关物种间差异表达分析生成无偏映射参考。
BMC Genomics. 2016 May 24;17:392. doi: 10.1186/s12864-016-2646-x.
5
Comprehensive Functional Annotation of Metagenomes and Microbial Genomes Using a Deep Learning-Based Method.基于深度学习的宏基因组和微生物组综合功能注释。
mSystems. 2023 Apr 27;8(2):e0117822. doi: 10.1128/msystems.01178-22. Epub 2023 Mar 7.
6
A linked-read approach to museomics: Higher quality de novo genome assemblies from degraded tissues.链接读取方法在宏基因组学中的应用:从降解组织中获得更高质量的从头基因组组装。
Mol Ecol Resour. 2020 Jul;20(4):856-870. doi: 10.1111/1755-0998.13155. Epub 2020 May 11.
7
Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes.纳米孔测序和 Shasta 工具包可实现 11 个人类基因组的高效从头组装。
Nat Biotechnol. 2020 Sep;38(9):1044-1053. doi: 10.1038/s41587-020-0503-6. Epub 2020 May 4.
8
MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.MAKER2:用于第二代基因组项目的注释流水线和基因组数据库管理工具。
BMC Bioinformatics. 2011 Dec 22;12:491. doi: 10.1186/1471-2105-12-491.
9
High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource.利用 PlantSEED 资源进行高通量比较、功能注释和植物基因组代谢建模。
Proc Natl Acad Sci U S A. 2014 Jul 1;111(26):9645-50. doi: 10.1073/pnas.1401329111. Epub 2014 Jun 9.
10
Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.利用系统发育信息注释(PIA)在非模式生物的转录组中搜索光相互作用基因。
BMC Bioinformatics. 2014 Nov 19;15(1):350. doi: 10.1186/s12859-014-0350-x.

引用本文的文献

1
Shifts in bee diet breadths are associated with gene gains and losses and positive selection across olfactory receptors.蜜蜂饮食广度的变化与基因的得失以及嗅觉受体的正选择有关。
G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf105.
2
CrossFilt: A Cross-species Filtering Tool that Eliminates Alignment Bias in Comparative Genomics Studies.CrossFilt:一种跨物种过滤工具,可消除比较基因组学研究中的比对偏差。
bioRxiv. 2025 Jun 6:2025.06.05.654938. doi: 10.1101/2025.06.05.654938.
3
Highly accurate assembly polishing with DeepPolisher.

本文引用的文献

1
Current methods for automated annotation of protein-coding genes.蛋白质编码基因自动注释的当前方法。
Curr Opin Insect Sci. 2015 Feb;7:8-14. doi: 10.1016/j.cois.2015.02.008. Epub 2015 Mar 7.
2
Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci.16 个多样化的实验室小鼠参考基因组定义了种群特异性单倍型和新的功能基因座。
Nat Genet. 2018 Nov;50(11):1574-1583. doi: 10.1038/s41588-018-0223-8. Epub 2018 Oct 1.
3
High-resolution comparative analysis of great ape genomes.高分辨率比较分析大型猿类基因组。
使用深度抛光机进行高精度装配抛光。
Genome Res. 2025 Jul 1;35(7):1595-1608. doi: 10.1101/gr.280149.124.
4
Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES.使用ROADIES从原始基因组组装中准确、可扩展且完全自动化地推断物种树。
Proc Natl Acad Sci U S A. 2025 May 13;122(19):e2500553122. doi: 10.1073/pnas.2500553122. Epub 2025 May 2.
5
Establishing genome sequencing and assembly for non-model and emerging model organisms: a brief guide.为非模式生物和新兴模式生物建立基因组测序与组装:简要指南
Front Zool. 2025 Apr 17;22(1):7. doi: 10.1186/s12983-025-00561-7.
6
Building better genome annotations across the tree of life.构建跨越生命之树的更优基因组注释。
Genome Res. 2025 May 2;35(5):1261-1276. doi: 10.1101/gr.280377.124.
7
Cellular evolution of the hypothalamic preoptic area of behaviorally divergent deer mice.行为不同的鹿鼠下丘脑视前区的细胞进化
Elife. 2025 Apr 7;13:RP103109. doi: 10.7554/eLife.103109.
8
Reference-free identification and pangenome analysis of accessory chromosomes in a major fungal plant pathogen.一种主要真菌植物病原体中附属染色体的无参考鉴定与泛基因组分析
NAR Genom Bioinform. 2025 Apr 2;7(2):lqaf034. doi: 10.1093/nargab/lqaf034. eCollection 2025 Jun.
9
CellBouncer, A Unified Toolkit for Single-Cell Demultiplexing and Ambient RNA Analysis, Reveals Hominid Mitochondrial Incompatibilities.CellBouncer,一种用于单细胞解复用和环境RNA分析的统一工具包,揭示了人类线粒体不相容性。
bioRxiv. 2025 Mar 23:2025.03.23.644821. doi: 10.1101/2025.03.23.644821.
10
MAFin: motif detection in multiple alignment files.MAFin:多序列比对文件中的基序检测
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf125.
Science. 2018 Jun 8;360(6393). doi: 10.1126/science.aar6343.
4
Repeat associated mechanisms of genome evolution and function revealed by the and genomes.揭示 和 基因组进化和功能相关机制的重复。
Genome Res. 2018 Apr;28(4):448-459. doi: 10.1101/gr.234096.117. Epub 2018 Mar 21.
5
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
6
De novo PacBio long-read and phased avian genome assemblies correct and add to reference genes generated with intermediate and short reads.从头 PacBio 长读长和相定鸟类基因组组装纠正并增加了中间和短读长生成的参考基因。
Gigascience. 2017 Oct 1;6(10):1-16. doi: 10.1093/gigascience/gix085.
7
Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells.纳米孔长读 RNA 测序揭示了个体 B 细胞表面受体之间广泛的转录变异性。
Nat Commun. 2017 Jul 19;8:16027. doi: 10.1038/ncomms16027.
8
Toil enables reproducible, open source, big biomedical data analyses.Toil支持可重复的、开源的大型生物医学数据分析。
Nat Biotechnol. 2017 Apr 11;35(4):314-316. doi: 10.1038/nbt.3772.
9
Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.对GRCh38和从头单倍体基因组组装的评估证明了参考组装的持久质量。
Genome Res. 2017 May;27(5):849-864. doi: 10.1101/gr.213611.116. Epub 2017 Apr 10.
10
Direct determination of diploid genome sequences.二倍体基因组序列的直接测定。
Genome Res. 2017 May;27(5):757-767. doi: 10.1101/gr.214874.116. Epub 2017 Apr 5.