• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GTRpmix:一种用于轮廓混合模型的关联广义时间可逆模型。

GTRpmix: A Linked General Time-Reversible Model for Profile Mixture Models.

机构信息

Department of Mathematics, California State University San Bernardino, San Bernardino, CA, USA.

Department of Biochemistry and Molecular Biology, Faculty of Medicine, Dalhousie University, Halifax, NS, Canada.

出版信息

Mol Biol Evol. 2024 Sep 4;41(9). doi: 10.1093/molbev/msae174.

DOI:10.1093/molbev/msae174
PMID:39158305
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11371462/
Abstract

Profile mixture models capture distinct biochemical constraints on the amino acid substitution process at different sites in proteins. These models feature a mixture of time-reversible models with a common matrix of exchangeabilities and distinct sets of equilibrium amino acid frequencies known as profiles. Combining the exchangeability matrix with each profile generates the matrix of instantaneous rates of amino acid exchange for that profile. Currently, empirically estimated exchangeability matrices (e.g. the LG matrix) are widely used for phylogenetic inference under profile mixture models. However, these were estimated using a single profile and are unlikely optimal for profile mixture models. Here, we describe the GTRpmix model that allows maximum likelihood estimation of a common exchangeability matrix under any profile mixture model. We show that exchangeability matrices estimated under profile mixture models differ from the LG matrix, dramatically improving model fit and topological estimation accuracy for empirical test cases. Because the GTRpmix model is computationally expensive, we provide two exchangeability matrices estimated from large concatenated phylogenomic-supermatrices to be used for phylogenetic analyses. One, called Eukaryotic Linked Mixture (ELM), is designed for phylogenetic analysis of proteins encoded by nuclear genomes of eukaryotes, and the other, Eukaryotic and Archaeal Linked mixture (EAL), for reconstructing relationships between eukaryotes and Archaea. These matrices, combined with profile mixture models, fit data better and have improved topology estimation relative to the LG matrix combined with the same mixture models. Starting with version 2.3.1, IQ-TREE2 allows users to estimate linked exchangeabilities (i.e. amino acid exchange rates) under profile mixture models.

摘要

轮廓混合模型捕捉到蛋白质不同位置上氨基酸替换过程的独特生化限制。这些模型的特点是混合了具有共同可交换矩阵和不同平衡氨基酸频率集(称为轮廓)的时间可逆模型。将可交换矩阵与每个轮廓相结合,为该轮廓生成瞬时氨基酸交换率矩阵。目前,经验估计的可交换矩阵(例如 LG 矩阵)广泛用于轮廓混合模型下的系统发育推断。然而,这些矩阵是使用单个轮廓估计的,不太可能是轮廓混合模型的最佳选择。在这里,我们描述了 GTRpmix 模型,该模型允许在任何轮廓混合模型下对共同可交换矩阵进行最大似然估计。我们表明,轮廓混合模型下估计的可交换矩阵与 LG 矩阵不同,极大地改善了经验测试案例的模型拟合和拓扑估计准确性。由于 GTRpmix 模型计算成本较高,我们提供了两个从大型串联基因组超级矩阵中估计的可交换矩阵,用于进行系统发育分析。一个称为真核生物链接混合(ELM),设计用于真核生物核基因组编码蛋白质的系统发育分析,另一个称为真核生物和古菌链接混合(EAL),用于重建真核生物和古菌之间的关系。这些矩阵与轮廓混合模型结合使用,可以更好地拟合数据,并且相对于 LG 矩阵与相同的混合模型结合使用时,拓扑估计有所改进。从版本 2.3.1 开始,IQ-TREE2 允许用户在轮廓混合模型下估计链接可交换性(即氨基酸交换率)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/da07d48b795e/msae174f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/1a94074b91e3/msae174f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/1e86f0e32952/msae174f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/bbb541a9e898/msae174f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/7b196a942b02/msae174f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/da07d48b795e/msae174f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/1a94074b91e3/msae174f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/1e86f0e32952/msae174f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/bbb541a9e898/msae174f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/7b196a942b02/msae174f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3b1/11371462/da07d48b795e/msae174f5.jpg

相似文献

1
GTRpmix: A Linked General Time-Reversible Model for Profile Mixture Models.GTRpmix:一种用于轮廓混合模型的关联广义时间可逆模型。
Mol Biol Evol. 2024 Sep 4;41(9). doi: 10.1093/molbev/msae174.
2
An amino acid substitution-selection model adjusts residue fitness to improve phylogenetic estimation.氨基酸替换选择模型调整残基适合度以改进系统发育估计。
Mol Biol Evol. 2014 Apr;31(4):779-92. doi: 10.1093/molbev/msu044. Epub 2014 Jan 16.
3
Is Over-parameterization a Problem for Profile Mixture Models?过参数化对轮廓混合模型是一个问题吗?
Syst Biol. 2024 May 27;73(1):53-75. doi: 10.1093/sysbio/syad063.
4
A class frequency mixture model that adjusts for site-specific amino acid frequencies and improves inference of protein phylogeny.一种根据特定位点氨基酸频率进行调整并改进蛋白质系统发育推断的类频率混合模型。
BMC Evol Biol. 2008 Dec 16;8:331. doi: 10.1186/1471-2148-8-331.
5
Phylogenetic mixture models for proteins.蛋白质的系统发育混合模型
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3965-76. doi: 10.1098/rstb.2008.0180.
6
Bayesian Cross-Validation Comparison of Amino Acid Replacement Models: Contrasting Profile Mixtures, Pairwise Exchangeabilities, and Gamma-Distributed Rates-Across-Sites.贝叶斯交叉验证比较氨基酸替换模型:对比分布混合模型、成对可交换性模型和γ分布的位点间速率模型。
J Mol Evol. 2022 Dec;90(6):468-475. doi: 10.1007/s00239-022-10076-y. Epub 2022 Oct 7.
7
Improving evolutionary models for mitochondrial protein data with site-class specific amino acid exchangeability matrices.利用具有特定位置类别的氨基酸可交换性矩阵改进线粒体蛋白数据的进化模型。
PLoS One. 2013;8(1):e55816. doi: 10.1371/journal.pone.0055816. Epub 2013 Jan 31.
8
Modeling Site Heterogeneity with Posterior Mean Site Frequency Profiles Accelerates Accurate Phylogenomic Estimation.利用后验均值位点频率分布模型化位点异质性可加速准确的系统基因组估计。
Syst Biol. 2018 Mar 1;67(2):216-235. doi: 10.1093/sysbio/syx068.
9
QMix: An Efficient Program to Automatically Estimate Multi-Matrix Mixture Models for Amino Acid Substitution Process.QMix:一种用于自动估计氨基酸替换过程的多矩阵混合模型的高效程序。
J Comput Biol. 2024 Aug;31(8):703-707. doi: 10.1089/cmb.2023.0403. Epub 2024 Jun 11.
10
QMaker: Fast and Accurate Method to Estimate Empirical Models of Protein Evolution.QMaker:一种快速准确的蛋白质进化经验模型估计方法。
Syst Biol. 2021 Aug 11;70(5):1046-1060. doi: 10.1093/sysbio/syab010.

引用本文的文献

1
Challenges in Assembling the Dated Tree of Life.组装有年代的生命之树的挑战。
Genome Biol Evol. 2024 Oct 9;16(10). doi: 10.1093/gbe/evae229.

本文引用的文献

1
MAST: Phylogenetic Inference with Mixtures Across Sites and Trees.MAST:跨越站点和树的混合系统发育推断。
Syst Biol. 2024 Jul 27;73(2):375-391. doi: 10.1093/sysbio/syae008.
2
Is Over-parameterization a Problem for Profile Mixture Models?过参数化对轮廓混合模型是一个问题吗?
Syst Biol. 2024 May 27;73(1):53-75. doi: 10.1093/sysbio/syad063.
3
Inference and reconstruction of the heimdallarchaeial ancestry of eukaryotes.真核生物 Heimdallarchaeia 祖先的推断和重建。
Nature. 2023 Jun;618(7967):992-999. doi: 10.1038/s41586-023-06186-2. Epub 2023 Jun 14.
4
AliSim: A Fast and Versatile Phylogenetic Sequence Simulator for the Genomic Era.AliSim:基因组时代快速且通用的进化序列模拟器。
Mol Biol Evol. 2022 May 3;39(5). doi: 10.1093/molbev/msac092.
5
PhyloFisher: A phylogenomic package for resolving eukaryotic relationships.PhyloFisher:解析真核生物关系的系统基因组学软件包。
PLoS Biol. 2021 Aug 6;19(8):e3001365. doi: 10.1371/journal.pbio.3001365. eCollection 2021 Aug.
6
QMaker: Fast and Accurate Method to Estimate Empirical Models of Protein Evolution.QMaker:一种快速准确的蛋白质进化经验模型估计方法。
Syst Biol. 2021 Aug 11;70(5):1046-1060. doi: 10.1093/sysbio/syab010.
7
Scalable Empirical Mixture Models That Account for Across-Site Compositional Heterogeneity.可扩展的经验混合模型,可解释跨站点组成异质性。
Mol Biol Evol. 2020 Dec 16;37(12):3616-3631. doi: 10.1093/molbev/msaa145.
8
The Relative Importance of Modeling Site Pattern Heterogeneity Versus Partition-Wise Heterotachy in Phylogenomic Inference.系统发育基因组推断中模型化地点模式异质性与分区异速进化的相对重要性。
Syst Biol. 2019 Nov 1;68(6):1003-1019. doi: 10.1093/sysbio/syz021.
9
Accelerated Estimation of Frequency Classes in Site-Heterogeneous Profile Mixture Models.加速估计局地异质轮廓混合模型中的频率类。
Mol Biol Evol. 2018 May 1;35(5):1266-1283. doi: 10.1093/molbev/msy026.
10
Modeling Site Heterogeneity with Posterior Mean Site Frequency Profiles Accelerates Accurate Phylogenomic Estimation.利用后验均值位点频率分布模型化位点异质性可加速准确的系统基因组估计。
Syst Biol. 2018 Mar 1;67(2):216-235. doi: 10.1093/sysbio/syx068.