• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Evolutionary and methodological considerations when interpreting gene presence-absence variation in pangenomes.

作者信息

Brůna Tomáš, Sreedasyam Avinash, Harder Avril M, Lovell John T

机构信息

DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, United States.

Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States.

出版信息

NAR Genom Bioinform. 2026 Jan 30;8(1):lqag011. doi: 10.1093/nargab/lqag011. eCollection 2026 Mar.

DOI:10.1093/nargab/lqag011
PMID:41625147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12856356/
Abstract

While graph-based pangenomes have become a standard and interoperable foundation for comparisons across multiple reference genomes, integrating protein-coding gene annotations across pangenomes in a single 'pangene set' remains challenging, both because of methodological inconsistency and biological presence-absence variation (PAV). Here, we review and experimentally evaluate the root of genome annotation and pangene set inconsistency using two polyploid plant pangenomes: cotton and soybean, which were chosen because of their existing diverse high-quality genomic resources and the known importance of gene PAV in their respective breeding programs. We first demonstrate that building pangene sets across different genome resources is highly error prone: PAV calculated directly from the genome annotations hosted on public repositories recapitulates structure in annotation methods and not biological sequence differences. Re-annotation of all genomes with a single identical pipeline largely resolves the broadest stroke issues; however, substantial challenges remain, including a surprisingly common case where exactly identical sequences have different gene model structural annotations. Combined, these results clearly show that pangenome gene model annotations must be carefully integrated before any biological inference can be made regarding sequence evolution, gene copy-number, or PAV.

摘要

相似文献

1
Evolutionary and methodological considerations when interpreting gene presence-absence variation in pangenomes.
NAR Genom Bioinform. 2026 Jan 30;8(1):lqag011. doi: 10.1093/nargab/lqag011. eCollection 2026 Mar.
2
Developing pangenomes for large and complex plant genomes and their representation formats.为大型复杂植物基因组开发泛基因组及其表示格式。
J Adv Res. 2025 Jan 31. doi: 10.1016/j.jare.2025.01.052.
3
A stepwise guide for pangenome development in crop plants: an alfalfa (Medicago sativa) case study.作物泛基因组开发的分步指南:以紫花苜蓿(Medicago sativa)为例。
BMC Genomics. 2024 Oct 31;25(1):1022. doi: 10.1186/s12864-024-10931-w.
4
Towards plant pangenomics.迈向植物泛基因组学。
Plant Biotechnol J. 2016 Apr;14(4):1099-105. doi: 10.1111/pbi.12499. Epub 2015 Nov 23.
5
Pangenomics in crop improvement-from coding structural variations to finding regulatory variants with pangenome graphs.作物改良中的泛基因组学——从编码结构变异到利用泛基因组图谱寻找调控变异
Plant Genome. 2022 Mar;15(1):e20177. doi: 10.1002/tpg2.20177. Epub 2021 Dec 13.
6
Brassica Panache: A multi-species graph pangenome representing presence absence variation across forty-one Brassica genomes.羽衣甘蓝:一个多物种图形泛基因组,代表了41个甘蓝型油菜基因组中的存在缺失变异。
Plant Genome. 2025 Mar;18(1):e20535. doi: 10.1002/tpg2.20535. Epub 2024 Dec 8.
7
Cotton pan-genome retrieves the lost sequences and genes during domestication and selection.棉花泛基因组找回了在驯化和选择过程中丢失的序列和基因。
Genome Biol. 2021 Apr 23;22(1):119. doi: 10.1186/s13059-021-02351-w.
8
What Are We Learning from Plant Pangenomes?我们从植物泛基因组中学到了什么?
Annu Rev Plant Biol. 2025 May;76(1):663-686. doi: 10.1146/annurev-arplant-090823-015358. Epub 2024 Dec 2.
9
A pangenome analysis pipeline provides insights into functional gene identification in rice.泛基因组分析流程为水稻功能基因鉴定提供了新视角。
Genome Biol. 2023 Jan 26;24(1):19. doi: 10.1186/s13059-023-02861-9.
10
Toward a standardized framework for pangenome graph evaluation: assessing crop plant pangenome variation graph construction from multiple assemblies.
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf121.

本文引用的文献

1
Helixer: ab initio prediction of primary eukaryotic gene models combining deep learning and a hidden Markov model.
Nat Methods. 2025 Nov 24. doi: 10.1038/s41592-025-02939-1.
2
Lost in translation: What we have learned from attributes that do not translate from Arabidopsis to other plants.翻译中的迷失:我们从无法从拟南芥转移至其他植物的特性中学到了什么。
Plant Cell. 2025 May 9;37(5). doi: 10.1093/plcell/koaf036.
3
Polyploidy-mediated variations in glutamate receptor proteins linked to Fusarium wilt resistance in upland cotton.多倍体介导的谷氨酸受体蛋白变异与陆地棉枯萎病抗性相关
Plant J. 2025 Apr;122(1):e70125. doi: 10.1111/tpj.70125.
4
PSAURON: a tool for assessing protein annotation across a broad range of species.PSAURON:一种用于评估广泛物种中蛋白质注释的工具。
NAR Genom Bioinform. 2025 Jan 7;7(1):lqae189. doi: 10.1093/nargab/lqae189. eCollection 2025 Mar.
5
Panacus: fast and exact pangenome growth and core size estimation.人参属:快速准确的泛基因组增长及核心大小估计
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae720.
6
What Are We Learning from Plant Pangenomes?我们从植物泛基因组中学到了什么?
Annu Rev Plant Biol. 2025 May;76(1):663-686. doi: 10.1146/annurev-arplant-090823-015358. Epub 2024 Dec 2.
7
Assembly, comparative analysis, and utilization of a single haplotype reference genome for soybean.大豆单倍型参考基因组的组装、比较分析和利用。
Plant J. 2024 Nov;120(3):1221-1235. doi: 10.1111/tpj.17026. Epub 2024 Sep 14.
8
Exploring crop genomes: assembly features, gene prediction accuracy, and implications for proteomics studies.探索作物基因组:组装特征、基因预测准确性及其对蛋白质组学研究的影响。
BMC Genomics. 2024 Jun 19;25(1):619. doi: 10.1186/s12864-024-10521-w.
9
BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA.BRAKER3:利用 RNA-seq 和蛋白质证据,通过 GeneMark-ETP、AUGUSTUS 和 TSEBRA 进行全自动基因组注释。
Genome Res. 2024 Jun 25;34(5):769-777. doi: 10.1101/gr.278090.123.
10
GeneMark-ETP significantly improves the accuracy of automatic annotation of large eukaryotic genomes.GeneMark-ETP 显著提高了大型真核基因组自动注释的准确性。
Genome Res. 2024 Jun 25;34(5):757-768. doi: 10.1101/gr.278373.123.