• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在蛋白质系统发生学中考虑溶剂可及性和二级结构显然是有益的。

Accounting for solvent accessibility and secondary structure in protein phylogenetics is clearly beneficial.

机构信息

Méthodes et Algorithmes pour la Bioinformatique, LIRMM, CNRS-Université Montpellier II, 161 rue Ada, Montpellier Cedex 5, France.

出版信息

Syst Biol. 2010 May;59(3):277-87. doi: 10.1093/sysbio/syq002. Epub 2010 Mar 10.

DOI:10.1093/sysbio/syq002
PMID:20525635
Abstract

Amino acid substitution models are essential to most methods to infer phylogenies from protein data. These models represent the ways in which proteins evolve and substitutions accumulate along the course of time. It is widely accepted that the substitution processes vary depending on the structural configuration of the protein residues. However, this information is very rarely used in phylogenetic studies, though the 3-dimensional structure of dozens of thousands of proteins has been elucidated. Here, we reinvestigate the question in order to fill this gap. We use an improved estimation methodology and a very large database comprising 1471 nonredundant globular protein alignments with structural annotations to estimate new amino acid substitution models accounting for the secondary structure and solvent accessibility of the residues. These models incorporate a confidence coefficient that is estimated from the data and reflects the reliability and usefulness of structural annotations in the analyzed sequences. Our results with 300 independent test alignments show an impressive likelihood gain compared with standard models such as JTT or WAG. Moreover, the use of these models induces significant topological changes in the inferred trees, which should be of primary interest to phylogeneticists. Our data, models, and software are available for download from http://atgc.lirmm.fr/phyml-structure/.

摘要

氨基酸替代模型对于大多数从蛋白质数据推断系统发育的方法来说都是必不可少的。这些模型代表了蛋白质随时间进化和替代积累的方式。人们普遍认为,替代过程取决于蛋白质残基的结构构象。然而,尽管已经阐明了数万个蛋白质的三维结构,但这种信息在系统发育研究中很少被使用。在这里,我们重新研究了这个问题,以填补这一空白。我们使用改进的估计方法和一个包含 1471 个具有结构注释的非冗余球状蛋白质比对的大型数据库,来估计新的氨基酸替代模型,这些模型考虑了残基的二级结构和溶剂可及性。这些模型包含一个置信系数,该系数是根据数据估计的,反映了分析序列中结构注释的可靠性和有用性。我们使用 300 个独立的测试比对的结果表明,与 JTT 或 WAG 等标准模型相比,我们的模型具有令人印象深刻的可能性增益。此外,在推断的树中,这些模型的使用会引起显著的拓扑变化,这应该是系统发育学家的主要关注点。我们的数据、模型和软件可从 http://atgc.lirmm.fr/phyml-structure/ 下载。

相似文献

1
Accounting for solvent accessibility and secondary structure in protein phylogenetics is clearly beneficial.在蛋白质系统发生学中考虑溶剂可及性和二级结构显然是有益的。
Syst Biol. 2010 May;59(3):277-87. doi: 10.1093/sysbio/syq002. Epub 2010 Mar 10.
2
Empirical models for substitution in ribosomal RNA.核糖体RNA中替代的经验模型。
Mol Biol Evol. 2004 Mar;21(3):419-27. doi: 10.1093/molbev/msh029. Epub 2003 Dec 5.
3
An improved general amino acid replacement matrix.一种改进的通用氨基酸置换矩阵。
Mol Biol Evol. 2008 Jul;25(7):1307-20. doi: 10.1093/molbev/msn067. Epub 2008 Mar 26.
4
Phylogenetic mixture models for proteins.蛋白质的系统发育混合模型
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3965-76. doi: 10.1098/rstb.2008.0180.
5
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
6
Modeling protein evolution with several amino acid replacement matrices depending on site rates.基于位点速率的几种氨基酸替换矩阵来模拟蛋白质进化。
Mol Biol Evol. 2012 Oct;29(10):2921-36. doi: 10.1093/molbev/mss112. Epub 2012 Apr 6.
7
Estimating maximum likelihood phylogenies with PhyML.使用PhyML估计最大似然系统发育树。
Methods Mol Biol. 2009;537:113-37. doi: 10.1007/978-1-59745-251-9_6.
8
Species trees from highly incongruent gene trees in rice.在水稻中,来自高度不一致基因树的种系发生树。
Syst Biol. 2009 Oct;58(5):489-500. doi: 10.1093/sysbio/syp054. Epub 2009 Sep 21.
9
Three-dimensional window analysis for detecting positive selection at structural regions of proteins.用于检测蛋白质结构区域正选择的三维窗口分析
Mol Biol Evol. 2004 Dec;21(12):2352-9. doi: 10.1093/molbev/msh249. Epub 2004 Sep 8.
10
Phylogenetic inference from conserved sites alignments.基于保守位点比对的系统发育推断。
J Exp Zool. 1999 Aug 15;285(2):128-39.

引用本文的文献

1
Protein Structural Phylogenetics.蛋白质结构系统发育学
Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf139.
2
The Evolutionary History and Modern Diversity of Triterpenoid Cyclases.三萜环化酶的进化史与现代多样性
Mol Biol Evol. 2025 Sep 1;42(9). doi: 10.1093/molbev/msaf203.
3
The evolutionary history and modern diversity of triterpenoid cyclases.三萜环化酶的进化史与现代多样性
bioRxiv. 2025 Aug 2:2024.10.28.620730. doi: 10.1101/2024.10.28.620730.
4
nT4X and nT4M: Novel Time Non-reversible Mixture Amino Acid Substitution Models.nT4X和nT4M:新型时间不可逆混合氨基酸取代模型。
J Mol Evol. 2025 Feb;93(1):136-148. doi: 10.1007/s00239-024-10230-8. Epub 2025 Jan 20.
5
PsiPartition: Improved Site Partitioning for Genomic Data by Parameterized Sorting Indices and Bayesian Optimization.PsiPartition:通过参数化排序索引和贝叶斯优化改进基因组数据的位点划分
J Mol Evol. 2024 Dec;92(6):874-890. doi: 10.1007/s00239-024-10215-7. Epub 2024 Dec 5.
6
Ancestral Reconstruction and the Evolution of Protein Energy Landscapes.祖先重建与蛋白质能量景观的演化。
Annu Rev Biophys. 2024 Jul;53(1):127-146. doi: 10.1146/annurev-biophys-030722-125440. Epub 2024 Jun 28.
7
Is Over-parameterization a Problem for Profile Mixture Models?过参数化对轮廓混合模型是一个问题吗?
Syst Biol. 2024 May 27;73(1):53-75. doi: 10.1093/sysbio/syad063.
8
Horizontal transfer of vertebrate vision gene into the chordate ancestor.脊椎动物视觉基因向脊索动物祖先的水平转移。
Proc Natl Acad Sci U S A. 2023 Aug 22;120(34):e2310390120. doi: 10.1073/pnas.2310390120. Epub 2023 Aug 14.
9
Asymmetric dimerization in a transcription factor superfamily is promoted by allosteric interactions with DNA.转录因子超家族中的不对称二聚化是由与 DNA 的变构相互作用促进的。
Nucleic Acids Res. 2023 Sep 8;51(16):8864-8879. doi: 10.1093/nar/gkad632.
10
Independent rediploidization masks shared whole genome duplication in the sturgeon-paddlefish ancestor.独立的二倍体化掩盖了鲟鱼-匙吻鲟祖先中的全基因组重复。
Nat Commun. 2023 May 19;14(1):2879. doi: 10.1038/s41467-023-38714-z.