• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过加权持久同调对蛋白质结构进行忠实解释可改进进化距离估计。

Faithful Interpretation of Protein Structures through Weighted Persistent Homology Improves Evolutionary Distance Estimation.

作者信息

Bou Dagher Léa, Madern Dominique, Malbos Philippe, Brochier-Armanet Céline

机构信息

Universite Claude Bernard Lyon 1, LBBE, UMR 5558, CNRS, VAS, Villeurbanne F-69622, France.

Université Claude Bernard Lyon 1, CNRS, Institut Camille Jordan, UMR5208, Villeurbanne F-69622, France.

出版信息

Mol Biol Evol. 2025 Feb 3;42(2). doi: 10.1093/molbev/msae271.

DOI:10.1093/molbev/msae271
PMID:39761698
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11789942/
Abstract

Phylogenetic inference is mainly based on sequence analysis and requires reliable alignments. This can be challenging, especially when sequences are highly divergent. In this context, the use of three-dimensional protein structures is a promising alternative. In a recent study, we introduced an original topological data analysis method based on persistent homology to estimate the evolutionary distances from structures. The method was successfully tested on 518 protein families representing 22,940 predicted structures. However, as anticipated, the reliability of the estimated evolutionary distances was impacted by the quality of the predicted structures and the presence of indels in the proteins. This paper introduces a new topological descriptor, called bio-topological marker (BTM), which provides a more faithful description of the structures, a topological analysis for estimating evolutionary distances from BTMs, and a new weight-filtering method adapted to protein structures. These new developments significantly improve the estimation of evolutionary distances and phylogenies inferred from structures.

摘要

系统发育推断主要基于序列分析,需要可靠的比对。这可能具有挑战性,尤其是当序列高度分化时。在这种情况下,使用三维蛋白质结构是一种很有前景的替代方法。在最近的一项研究中,我们引入了一种基于持久同调的原始拓扑数据分析方法,以从结构中估计进化距离。该方法在代表22940个预测结构的518个蛋白质家族上成功进行了测试。然而,正如预期的那样,估计的进化距离的可靠性受到预测结构的质量和蛋白质中插入缺失的影响。本文介绍了一种新的拓扑描述符,称为生物拓扑标记(BTM),它提供了对结构更忠实的描述、一种用于从BTM估计进化距离的拓扑分析,以及一种适用于蛋白质结构的新权重过滤方法。这些新进展显著改进了从结构推断的进化距离和系统发育的估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50e7/11789942/c13816ce4a9c/msae271f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50e7/11789942/7e1f0775755d/msae271f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50e7/11789942/ccbc960fb0fa/msae271f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50e7/11789942/c13816ce4a9c/msae271f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50e7/11789942/7e1f0775755d/msae271f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50e7/11789942/ccbc960fb0fa/msae271f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50e7/11789942/c13816ce4a9c/msae271f3.jpg

相似文献

1
Faithful Interpretation of Protein Structures through Weighted Persistent Homology Improves Evolutionary Distance Estimation.通过加权持久同调对蛋白质结构进行忠实解释可改进进化距离估计。
Mol Biol Evol. 2025 Feb 3;42(2). doi: 10.1093/molbev/msae271.
2
A sequence-based evolutionary distance method for Phylogenetic analysis of highly divergent proteins.一种基于序列的进化距离方法,用于高度分化蛋白的系统发育分析。
Sci Rep. 2023 Nov 20;13(1):20304. doi: 10.1038/s41598-023-47496-9.
3
Use of a database of structural alignments and phylogenetic trees in investigating the relationship between sequence and structural variability among homologous proteins.利用结构比对和系统发育树数据库研究同源蛋白质序列与结构变异性之间的关系。
Protein Eng. 2001 Apr;14(4):219-26. doi: 10.1093/protein/14.4.219.
4
Phylogeny reconstruction: increasing the accuracy of pairwise distance estimation using Bayesian inference of evolutionary rates.系统发育重建:利用进化速率的贝叶斯推断提高成对距离估计的准确性。
Bioinformatics. 2007 Jan 15;23(2):e136-41. doi: 10.1093/bioinformatics/btl304.
5
Persistent homology reveals strong phylogenetic signal in 3D protein structures.持久同调揭示了三维蛋白质结构中强大的系统发育信号。
PNAS Nexus. 2024 Apr 17;3(4):pgae158. doi: 10.1093/pnasnexus/pgae158. eCollection 2024 Apr.
6
Phylogenetic inference with weighted codon evolutionary distances.基于加权密码子进化距离的系统发育推断。
J Mol Evol. 2009 Apr;68(4):377-92. doi: 10.1007/s00239-009-9212-y. Epub 2009 Mar 24.
7
Efficient biased estimation of evolutionary distances when substitution rates vary across sites.当替换率在不同位点变化时进化距离的有效偏差估计。
Mol Biol Evol. 2002 Apr;19(4):534-43. doi: 10.1093/oxfordjournals.molbev.a004109.
8
A stochastic evolutionary model for protein structure alignment and phylogeny.一种用于蛋白质结构比对和系统发生的随机进化模型。
Mol Biol Evol. 2012 Nov;29(11):3575-87. doi: 10.1093/molbev/mss167. Epub 2012 Jun 21.
9
Joint evolutionary trees: a large-scale method to predict protein interfaces based on sequence sampling.联合进化树:一种基于序列采样预测蛋白质界面的大规模方法。
PLoS Comput Biol. 2009 Jan;5(1):e1000267. doi: 10.1371/journal.pcbi.1000267. Epub 2009 Jan 23.
10
Indel reliability in indel-based phylogenetic inference.基于插入缺失的系统发育推断中插入缺失的可靠性。
Genome Biol Evol. 2014 Nov 18;6(12):3199-209. doi: 10.1093/gbe/evu252.

本文引用的文献

1
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
2
Persistent homology reveals strong phylogenetic signal in 3D protein structures.持久同调揭示了三维蛋白质结构中强大的系统发育信号。
PNAS Nexus. 2024 Apr 17;3(4):pgae158. doi: 10.1093/pnasnexus/pgae158. eCollection 2024 Apr.
3
The SKMT Algorithm: A method for assessing and comparing underlying protein entanglement.
SKMT 算法:一种评估和比较潜在蛋白质纠缠的方法。
PLoS Comput Biol. 2023 Nov 27;19(11):e1011248. doi: 10.1371/journal.pcbi.1011248. eCollection 2023 Nov.
4
PC_ali: a tool for improved multiple alignments and evolutionary inference based on a hybrid protein sequence and structure similarity score.PC_ali:一种基于混合蛋白质序列和结构相似度得分的改进多重比对和进化推断的工具。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad630.
5
SARS-CoV-2 protein structure and sequence mutations: Evolutionary analysis and effects on virus variants.SARS-CoV-2 蛋白结构和序列突变:进化分析及其对病毒变体的影响。
PLoS One. 2023 Jul 20;18(7):e0283400. doi: 10.1371/journal.pone.0283400. eCollection 2023.
6
Fast and accurate protein structure search with Foldseek.使用 Foldseek 进行快速准确的蛋白质结构搜索。
Nat Biotechnol. 2024 Feb;42(2):243-246. doi: 10.1038/s41587-023-01773-0. Epub 2023 May 8.
7
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
8
DALI shines a light on remote homologs: One hundred discoveries.DALI 揭示了远程同源物:一百项发现。
Protein Sci. 2023 Jan;32(1):e4519. doi: 10.1002/pro.4519.
9
US-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes.US-align:蛋白质、核酸和大分子复合物的通用结构比对。
Nat Methods. 2022 Sep;19(9):1109-1115. doi: 10.1038/s41592-022-01585-1. Epub 2022 Aug 29.
10
Fast protein structure comparison through effective representation learning with contrastive graph neural networks.通过对比图神经网络的有效表示学习进行快速蛋白质结构比较。
PLoS Comput Biol. 2022 Mar 24;18(3):e1009986. doi: 10.1371/journal.pcbi.1009986. eCollection 2022 Mar.