• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

新开发的基于结构的方法在大规模系统发育基因组学方面并不优于标准的基于序列的方法。

Newly Developed Structure-Based Methods Do Not Outperform Standard Sequence-Based Methods for Large-Scale Phylogenomics.

作者信息

Mutti Giacomo, Ocaña-Pallarès Eduard, Gabaldón Toni

机构信息

Barcelona Supercomputing Centre (BSC-CNS), Barcelona 08034, Spain.

Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona 08028, Spain.

出版信息

Mol Biol Evol. 2025 Jul 1;42(7). doi: 10.1093/molbev/msaf149.

DOI:10.1093/molbev/msaf149
PMID:40580945
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12290511/
Abstract

Recent developments in protein structure prediction have allowed the use of this previously limited source of information at genome-wide scales. It has been proposed that the use of structural information may offer advantages over sequences in phylogenetic reconstruction, due to their slower rate of evolution and direct correlation to function. Here, we examined how recently developed methods for structure-based homology search and tree reconstruction compare with current state-of-the-art sequence-based methods in reconstructing genome-wide collections of gene phylogenies (i.e. phylomes). While structure-based methods can be useful in specific scenarios, we found that their current performance does not justify using the newly developed structure-based methods as a default choice in large-scale phylogenetic studies. On the one hand, the best performing sequence-based tree reconstruction methods still outperform structure-based methods for this task. On the other hand, structure-based homology detection methods provide larger lists of candidate homologs, as previously reported. However, this comes at the expense of missing hits identified by sequence-based methods, as well as providing sets of homolog candidates with higher fractions of false positives. These insights help to guide the use of structural data in comparative genomics and highlight the need to continue improving structure-based approaches. Our pipeline is fully reproducible and has been implemented in a Snakemake workflow. This will facilitate a continuous assessment of future improvements of structure-based tools in the AlphaFold era.

摘要

蛋白质结构预测的最新进展使得在全基因组范围内能够利用这一先前有限的信息来源。有人提出,由于结构信息的进化速度较慢且与功能直接相关,因此在系统发育重建中使用结构信息可能比序列具有优势。在此,我们研究了最近开发的基于结构的同源性搜索和树重建方法与当前基于序列的最先进方法在重建全基因组基因系统发育集合(即系统发育组)方面的比较情况。虽然基于结构的方法在特定情况下可能有用,但我们发现它们目前的性能并不足以证明在大规模系统发育研究中将新开发的基于结构的方法作为默认选择是合理的。一方面,对于这项任务,性能最佳的基于序列的树重建方法仍然优于基于结构的方法。另一方面,如先前报道的那样,基于结构的同源性检测方法提供了更大的候选同源物列表。然而,这是以错过基于序列的方法识别的命中结果为代价的,同时还提供了具有更高假阳性率的同源候选物集。这些见解有助于指导在比较基因组学中使用结构数据,并突出了继续改进基于结构的方法的必要性。我们的流程是完全可重复的,并且已在Snakemake工作流程中实现。这将有助于在AlphaFold时代对基于结构的工具的未来改进进行持续评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/914f/12290511/b37522b3e32c/msaf149f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/914f/12290511/7b88a3eb361c/msaf149f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/914f/12290511/8fb64b5036dc/msaf149f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/914f/12290511/b37522b3e32c/msaf149f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/914f/12290511/7b88a3eb361c/msaf149f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/914f/12290511/8fb64b5036dc/msaf149f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/914f/12290511/b37522b3e32c/msaf149f3.jpg

相似文献

1
Newly Developed Structure-Based Methods Do Not Outperform Standard Sequence-Based Methods for Large-Scale Phylogenomics.新开发的基于结构的方法在大规模系统发育基因组学方面并不优于标准的基于序列的方法。
Mol Biol Evol. 2025 Jul 1;42(7). doi: 10.1093/molbev/msaf149.
2
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
3
Systemic Inflammatory Response Syndrome全身炎症反应综合征
4
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
6
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
8
"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”:对自闭症成年人任务沉浸现象学体验的质性研究
Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.
9
Direct composite resin fillings versus amalgam fillings for permanent posterior teeth.直接复合树脂充填与银汞合金充填用于永久性后牙。
Cochrane Database Syst Rev. 2021 Aug 13;8(8):CD005620. doi: 10.1002/14651858.CD005620.pub3.
10
The educational effects of portfolios on undergraduate student learning: a Best Evidence Medical Education (BEME) systematic review. BEME Guide No. 11.档案袋对本科学生学习的教育效果:最佳证据医学教育(BEME)系统评价。BEME指南第11号。
Med Teach. 2009 Apr;31(4):282-98. doi: 10.1080/01421590902889897.

引用本文的文献

1
Protein Structural Phylogenetics.蛋白质结构系统发育学
Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf139.
2
The Evolutionary History and Modern Diversity of Triterpenoid Cyclases.三萜环化酶的进化史与现代多样性
Mol Biol Evol. 2025 Sep 1;42(9). doi: 10.1093/molbev/msaf203.
3
The evolutionary history and modern diversity of triterpenoid cyclases.三萜环化酶的进化史与现代多样性

本文引用的文献

1
A General Substitution Matrix for Structural Phylogenetics.一种用于结构系统发育学的通用替换矩阵。
Mol Biol Evol. 2025 Jun 4;42(6). doi: 10.1093/molbev/msaf124.
2
AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences.2024 年的 AlphaFold 蛋白质结构数据库:为超过 2.14 亿个蛋白质序列提供结构覆盖。
Nucleic Acids Res. 2024 Jan 5;52(D1):D368-D375. doi: 10.1093/nar/gkad1011.
3
Remote homolog detection places insect chemoreceptors in a cryptic protein superfamily spanning the tree of life.
bioRxiv. 2025 Aug 2:2024.10.28.620730. doi: 10.1101/2024.10.28.620730.
远程同源检测将昆虫化感器置于跨越生命之树的隐蔽蛋白超家族中。
Curr Biol. 2023 Nov 20;33(22):5023-5033.e4. doi: 10.1016/j.cub.2023.10.008. Epub 2023 Oct 31.
4
Fast and accurate protein structure search with Foldseek.使用 Foldseek 进行快速准确的蛋白质结构搜索。
Nat Biotechnol. 2024 Feb;42(2):243-246. doi: 10.1038/s41587-023-01773-0. Epub 2023 May 8.
5
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
6
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
7
Sustainable data analysis with Snakemake.使用 Snakemake 进行可持续数据分析。
F1000Res. 2021 Jan 18;10:33. doi: 10.12688/f1000research.29032.2. eCollection 2021.
8
IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era.IQ-TREE 2:基因组时代系统发育推断的新模型和有效方法。
Mol Biol Evol. 2020 May 1;37(5):1530-1534. doi: 10.1093/molbev/msaa015.
9
eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses.eggNOG 5.0:一个基于 5090 个生物体和 2502 种病毒的层次化、功能和系统发育注释的同源资源。
Nucleic Acids Res. 2019 Jan 8;47(D1):D309-D314. doi: 10.1093/nar/gky1085.
10
FastME 2.0: A Comprehensive, Accurate, and Fast Distance-Based Phylogeny Inference Program.FastME 2.0:一个全面、准确、快速的基于距离的系统发育推断程序。
Mol Biol Evol. 2015 Oct;32(10):2798-800. doi: 10.1093/molbev/msv150. Epub 2015 Jun 30.