• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

三个豌豆蚜基因组组装的新基因注释允许对基因和基因家族进化进行比较分析。

New gene annotations for three pea aphid genome assemblies allow comparative analyses of genes and gene family evolution.

作者信息

Deem Kevin D, Brisson Jennifer A

机构信息

Department of Biology, University of Rochester, Rochester, NY, 14627.

出版信息

bioRxiv. 2025 May 13:2025.05.08.652899. doi: 10.1101/2025.05.08.652899.

DOI:10.1101/2025.05.08.652899
PMID:40462938
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12132395/
Abstract

Reliable genome annotation is crucial for analyses of gene function, conservation, and evolution. Factors such as the sequencing technology used to create the assembly and the amount of duplicated sequence within the genome of interest can have a large impact on the quality of gene annotations. In particular, short read-based assemblies tend to mis-assemble duplicated genes as single loci, a problem that requires additional long read sequencing to resolve. Pea aphids exhibit a high level of gene duplication, resulting in mis-assembly and mis-annotation of genes in the short read reference genome. Here, we re-annotate the pea aphid reference genome, along with two long read pea aphid genomes, to facilitate future analyses of gene duplication and function in pea aphids. We use an integrated approach, consolidating both and RNAseq-based annotations into unified gene models. The new annotations contain genes that were missing, mis-annotated, or mis-assembled in the reference, and are generally consistent across assemblies, showing very good agreement between the long read assemblies. Our annotation method is sensitive enough to refine existing gene models, uncovering alternatively used promoters and isoforms, and aids in finding gene duplications. These data provide a useful supplement to the existing reference annotations and a new comparative framework for discovery and analysis of gene function and duplication in this important emerging model insect.

摘要

可靠的基因组注释对于基因功能、保守性和进化分析至关重要。诸如用于创建组装的测序技术以及感兴趣基因组内重复序列的数量等因素,可能会对基因注释的质量产生重大影响。特别是,基于短读长的组装往往会将重复基因错误地组装为单个位点,这个问题需要额外的长读长测序来解决。豌豆蚜表现出高水平的基因重复,导致短读长参考基因组中的基因出现错误组装和错误注释。在这里,我们对豌豆蚜参考基因组以及两个长读长豌豆蚜基因组进行重新注释,以促进未来对豌豆蚜基因重复和功能的分析。我们采用一种综合方法,将基于 和RNAseq的注释整合到统一的基因模型中。新的注释包含参考基因组中缺失、错误注释或错误组装的基因,并且在各个组装之间总体上是一致的,在长读长组装之间显示出非常好的一致性。我们的注释方法足够灵敏,能够完善现有的基因模型,发现交替使用的启动子和异构体,并有助于发现基因重复。这些数据为现有的参考注释提供了有用的补充,并为这个重要的新兴模式昆虫中基因功能和重复的发现与分析提供了一个新的比较框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/340328bf78d1/nihpp-2025.05.08.652899v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/14fa370fd399/nihpp-2025.05.08.652899v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/5a08f31f6e0f/nihpp-2025.05.08.652899v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/39e39d42479a/nihpp-2025.05.08.652899v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/d80d91e445e0/nihpp-2025.05.08.652899v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/340328bf78d1/nihpp-2025.05.08.652899v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/14fa370fd399/nihpp-2025.05.08.652899v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/5a08f31f6e0f/nihpp-2025.05.08.652899v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/39e39d42479a/nihpp-2025.05.08.652899v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/d80d91e445e0/nihpp-2025.05.08.652899v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c30/12132395/340328bf78d1/nihpp-2025.05.08.652899v1-f0005.jpg

相似文献

1
New gene annotations for three pea aphid genome assemblies allow comparative analyses of genes and gene family evolution.三个豌豆蚜基因组组装的新基因注释允许对基因和基因家族进化进行比较分析。
bioRxiv. 2025 May 13:2025.05.08.652899. doi: 10.1101/2025.05.08.652899.
2
Annotation of transcription factors, chromatin-associated factors, and basal transcription machinery in the pea aphid, Acyrthosiphon pisum, and development of the ATFdb database, a resource for studies of transcriptional regulation.豌豆蚜(Acyrthosiphon pisum)中转录因子、染色质相关因子和基础转录机制的注释以及ATFdb数据库的开发,该数据库是转录调控研究的资源。
Insect Biochem Mol Biol. 2025 Feb;177:104217. doi: 10.1016/j.ibmb.2024.104217. Epub 2024 Nov 22.
3
Gene Family Evolution in the Pea Aphid Based on Chromosome-Level Genome Assembly.基于染色体水平基因组组装的豌豆蚜基因家族进化。
Mol Biol Evol. 2019 Oct 1;36(10):2143-2156. doi: 10.1093/molbev/msz138.
4
Using multiple reference genomes to identify and resolve annotation inconsistencies.使用多个参考基因组来识别和解决注释不一致性。
BMC Genomics. 2020 Apr 8;21(1):281. doi: 10.1186/s12864-020-6696-8.
5
Selection following Gene Duplication Shapes Recent Genome Evolution in the Pea Aphid Acyrthosiphon pisum.基因复制后的选择塑造了豌豆蚜 Acyrthosiphon pisum 近期的基因组进化。
Mol Biol Evol. 2020 Sep 1;37(9):2601-2615. doi: 10.1093/molbev/msaa110.
6
Evaluating long-read assemblers to assemble several aphididae genomes.评估长读长序列拼接软件以拼接多个蚜科基因组。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf105.
7
A dual-genome microarray for the pea aphid, Acyrthosiphon pisum, and its obligate bacterial symbiont, Buchnera aphidicola.一种用于豌豆蚜(Acyrthosiphon pisum)及其专性细菌共生体蚜虫内共生菌(Buchnera aphidicola)的双基因组微阵列。
BMC Genomics. 2006 Mar 14;7:50. doi: 10.1186/1471-2164-7-50.
8
Genome assembly has a major impact on gene content: a comparison of annotation in two Bos taurus assemblies.基因组组装对基因组成有重大影响:两个牛属基因组组装中注释的比较。
PLoS One. 2011;6(6):e21400. doi: 10.1371/journal.pone.0021400. Epub 2011 Jun 22.
9
Chromosome-Scale Genome Assemblies of Aphids Reveal Extensively Rearranged Autosomes and Long-Term Conservation of the X Chromosome.蚜虫染色体水平基因组组装揭示了广泛重排的常染色体和 X 染色体的长期保守性。
Mol Biol Evol. 2021 Mar 9;38(3):856-875. doi: 10.1093/molbev/msaa246.
10
Genome sequence of the pea aphid Acyrthosiphon pisum.豌豆蚜 Acyrthosiphon pisum 的基因组序列。
PLoS Biol. 2010 Feb 23;8(2):e1000313. doi: 10.1371/journal.pbio.1000313.

本文引用的文献

1
Long-read genome sequencing resolves complex genomic rearrangements in rare genetic syndromes.长读长基因组测序可解析罕见遗传综合征中的复杂基因组重排。
NPJ Genom Med. 2024 Dec 18;9(1):66. doi: 10.1038/s41525-024-00454-4.
2
OrthoDB and BUSCO update: annotation of orthologs with wider sampling of genomes.OrthoDB和BUSCO更新:通过更广泛的基因组采样对直系同源基因进行注释。
Nucleic Acids Res. 2025 Jan 6;53(D1):D516-D522. doi: 10.1093/nar/gkae987.
3
Duplications and Retrogenes Are Numerous and Widespread in Modern Canine Genomic Assemblies.
现代犬基因组中存在大量重复序列和返基因。
Genome Biol Evol. 2024 Jul 3;16(7). doi: 10.1093/gbe/evae142.
4
BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA.BRAKER3:利用 RNA-seq 和蛋白质证据,通过 GeneMark-ETP、AUGUSTUS 和 TSEBRA 进行全自动基因组注释。
Genome Res. 2024 Jun 25;34(5):769-777. doi: 10.1101/gr.278090.123.
5
The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update.Galaxy 平台,用于可访问、可重现和协作的数据分析:2024 年更新。
Nucleic Acids Res. 2024 Jul 5;52(W1):W83-W94. doi: 10.1093/nar/gkae410.
6
Long-read technologies identify a hidden inverted duplication in a family with choroideremia.长读长技术在一个患有脉络膜视网膜炎的家族中发现了一个隐藏的反向重复序列。
HGG Adv. 2021 Jul 20;2(4):100046. doi: 10.1016/j.xhgg.2021.100046. eCollection 2021 Oct 14.
7
BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes.BUSCO 更新:用于真核生物、原核生物和病毒基因组评分的新颖且简化的工作流程以及更广泛和更深的系统发育覆盖范围。
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654. doi: 10.1093/molbev/msab199.
8
Impact of short-read sequencing on the misassembly of a plant genome.短读测序对植物基因组组装错误的影响。
BMC Genomics. 2021 Feb 2;22(1):99. doi: 10.1186/s12864-021-07397-5.
9
Long-read human genome sequencing and its applications.长读长基因组测序及其应用。
Nat Rev Genet. 2020 Oct;21(10):597-614. doi: 10.1038/s41576-020-0236-x. Epub 2020 Jun 5.
10
GFF Utilities: GffRead and GffCompare.
F1000Res. 2020 Apr 28;9. doi: 10.12688/f1000research.23297.2. eCollection 2020.