• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

核苷酸替换模型拟合的全基因组异质性。

Genome-wide heterogeneity of nucleotide substitution model fit.

机构信息

Department of Biochemistry, Genetics, and Immunology, University of Vigo, Vigo, Spain.

出版信息

Genome Biol Evol. 2011;3:896-908. doi: 10.1093/gbe/evr080. Epub 2011 Aug 7.

DOI:10.1093/gbe/evr080
PMID:21824869
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3175760/
Abstract

At a genomic scale, the patterns that have shaped molecular evolution are believed to be largely heterogeneous. Consequently, comparative analyses should use appropriate probabilistic substitution models that capture the main features under which different genomic regions have evolved. While efforts have concentrated in the development and understanding of model selection techniques, no descriptions of overall relative substitution model fit at the genome level have been reported. Here, we provide a characterization of best-fit substitution models across three genomic data sets including coding regions from mammals, vertebrates, and Drosophila (24,000 alignments). According to the Akaike Information Criterion (AIC), 82 of 88 models considered were selected as best-fit models at least in one occasion, although with very different frequencies. Most parameter estimates also varied broadly among genes. Patterns found for vertebrates and Drosophila were quite similar and often more complex than those found in mammals. Phylogenetic trees derived from models in the 95% confidence interval set showed much less variance and were significantly closer to the tree estimated under the best-fit model than trees derived from models outside this interval. Although alternative criteria selected simpler models than the AIC, they suggested similar patterns. All together our results show that at a genomic scale, different gene alignments for the same set of taxa are best explained by a large variety of different substitution models and that model choice has implications on different parameter estimates including the inferred phylogenetic trees. After taking into account the differences related to sample size, our results suggest a noticeable diversity in the underlying evolutionary process. All together, we conclude that the use of model selection techniques is important to obtain consistent phylogenetic estimates from real data at a genomic scale.

摘要

在基因组尺度上,塑造分子进化的模式被认为是高度异质的。因此,比较分析应该使用适当的概率替代模型,这些模型可以捕捉不同基因组区域进化的主要特征。虽然人们集中精力开发和理解模型选择技术,但目前还没有报道过在基因组水平上对总体相对替代模型拟合的描述。在这里,我们对三个基因组数据集(包括哺乳动物、脊椎动物和果蝇的编码区的 24,000 个比对)中的最佳替代模型进行了特征描述。根据赤池信息量准则(AIC),在所考虑的 88 个模型中,有 82 个模型至少在一次情况下被选为最佳拟合模型,尽管频率差异非常大。大多数参数估计值在基因之间也有很大的差异。在脊椎动物和果蝇中发现的模式与在哺乳动物中发现的模式非常相似,而且往往更复杂。从置信区间集内的模型得出的系统发育树显示出的方差要小得多,并且比从置信区间外的模型得出的树与最佳拟合模型估计的树更接近。虽然替代标准选择的模型比 AIC 更简单,但它们也提出了类似的模式。总之,我们的研究结果表明,在基因组尺度上,同一组分类群的不同基因比对最好用大量不同的替代模型来解释,并且模型选择对包括推断的系统发育树在内的不同参数估计值有影响。在考虑到样本大小差异后,我们的研究结果表明,在潜在的进化过程中存在明显的多样性。总之,我们的结论是,在基因组尺度上,使用模型选择技术对于从真实数据中获得一致的系统发育估计是很重要的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/702b60145d83/gbeevr080f06_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/49a1badc373e/gbeevr080f01_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/028b47f4734c/gbeevr080f02_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/7be6408372f4/gbeevr080f03_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/47cab1a23ecb/gbeevr080f04_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/e5a9d4ec8e5c/gbeevr080f05_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/702b60145d83/gbeevr080f06_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/49a1badc373e/gbeevr080f01_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/028b47f4734c/gbeevr080f02_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/7be6408372f4/gbeevr080f03_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/47cab1a23ecb/gbeevr080f04_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/e5a9d4ec8e5c/gbeevr080f05_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/030c/3175760/702b60145d83/gbeevr080f06_ht.jpg

相似文献

1
Genome-wide heterogeneity of nucleotide substitution model fit.核苷酸替换模型拟合的全基因组异质性。
Genome Biol Evol. 2011;3:896-908. doi: 10.1093/gbe/evr080. Epub 2011 Aug 7.
2
The impact of software and criteria on the selection of best-fit nucleotide substitution models for molecular evolutionary genetic analysis.软件和标准对分子进化遗传分析中最佳拟合核苷酸替换模型选择的影响。
PLoS One. 2025 Mar 26;20(3):e0319774. doi: 10.1371/journal.pone.0319774. eCollection 2025.
3
Advantages of a mechanistic codon substitution model for evolutionary analysis of protein-coding sequences.机械密码子替换模型在蛋白质编码序列进化分析中的优势。
PLoS One. 2011;6(12):e28892. doi: 10.1371/journal.pone.0028892. Epub 2011 Dec 29.
4
The effect of alignment uncertainty, substitution models and priors in building and dating the mammal tree of life.在构建和定时代哺乳动物系统发育树时,配准不确定性、替代模型和先验概率的影响。
BMC Evol Biol. 2019 Nov 6;19(1):203. doi: 10.1186/s12862-019-1534-9.
5
Does choice in model selection affect maximum likelihood analysis?模型选择中的选择会影响最大似然分析吗?
Syst Biol. 2008 Feb;57(1):76-85. doi: 10.1080/10635150801898920.
6
An amino acid substitution-selection model adjusts residue fitness to improve phylogenetic estimation.氨基酸替换选择模型调整残基适合度以改进系统发育估计。
Mol Biol Evol. 2014 Apr;31(4):779-92. doi: 10.1093/molbev/msu044. Epub 2014 Jan 16.
7
Homoplasy in genome-wide analysis of rare amino acid replacements: the molecular-evolutionary basis for Vavilov's law of homologous series.全基因组范围内稀有氨基酸替换分析中的同塑性:瓦维洛夫同源系列法则的分子进化基础
Biol Direct. 2008 Mar 17;3:7. doi: 10.1186/1745-6150-3-7.
8
Efficiencies of different genes and different tree-building methods in recovering a known vertebrate phylogeny.不同基因和不同建树方法在恢复已知脊椎动物系统发育关系方面的效率。
Mol Biol Evol. 1996 Mar;13(3):525-36. doi: 10.1093/oxfordjournals.molbev.a025613.
9
The effect of branch length variation on the selection of models of molecular evolution.分支长度变异对分子进化模型选择的影响。
J Mol Evol. 2001 May;52(5):434-44. doi: 10.1007/s002390010173.
10
Insights from modeling protein evolution with context-dependent mutation and asymmetric amino acid selection.通过上下文依赖突变和不对称氨基酸选择对蛋白质进化进行建模的见解。
Mol Biol Evol. 2007 Dec;24(12):2632-47. doi: 10.1093/molbev/msm190. Epub 2007 Sep 28.

引用本文的文献

1
Selection among site-dependent structurally constrained substitution models of protein evolution by approximate Bayesian computation.基于近似贝叶斯计算的蛋白质进化中依赖于位置的结构约束替代模型的选择。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae096.
2
Felsenstein Phylogenetic Likelihood.费雪氏系统发生似然
J Mol Evol. 2021 Apr;89(3):134-145. doi: 10.1007/s00239-020-09982-w. Epub 2021 Jan 13.
3
ModelTest-NG: A New and Scalable Tool for the Selection of DNA and Protein Evolutionary Models.ModelTest-NG:一种用于选择 DNA 和蛋白质进化模型的新型可扩展工具。

本文引用的文献

1
Incomplete lineage sorting patterns among human, chimpanzee, and orangutan suggest recent orangutan speciation and widespread selection.人类、黑猩猩和猩猩之间不完全的谱系分选模式表明猩猩最近发生了物种形成,并发生了广泛的选择。
Genome Res. 2011 Mar;21(3):349-56. doi: 10.1101/gr.114751.110. Epub 2011 Jan 26.
2
Among-site rate variation and its impact on phylogenetic analyses.种间变异率及其对系统发育分析的影响。
Trends Ecol Evol. 1996 Sep;11(9):367-72. doi: 10.1016/0169-5347(96)10041-0.
3
Mammalian evolution may not be strictly bifurcating.哺乳动物的进化可能不是严格二分的。
Mol Biol Evol. 2020 Jan 1;37(1):291-294. doi: 10.1093/molbev/msz189.
4
SPLATCHE3: simulation of serial genetic data under spatially explicit evolutionary scenarios including long-distance dispersal.SPLATCHE3:在包括远距离扩散在内的空间明确进化场景下模拟连续遗传数据。
Bioinformatics. 2019 Nov 1;35(21):4480-4483. doi: 10.1093/bioinformatics/btz311.
5
Microbial sequence typing in the genomic era.基因组时代的微生物序列分型。
Infect Genet Evol. 2018 Sep;63:346-359. doi: 10.1016/j.meegid.2017.09.022. Epub 2017 Sep 21.
6
SMS: Smart Model Selection in PhyML.SMS:PhyML中的智能模型选择。
Mol Biol Evol. 2017 Sep 1;34(9):2422-2424. doi: 10.1093/molbev/msx149.
7
Species Delimitation and Interspecific Relationships of the Genus (Brassicaceae) Inferred from Whole Chloroplast Genomes.基于全叶绿体基因组推断十字花科菘蓝属的物种界定及种间关系
Front Plant Sci. 2016 Dec 6;7:1826. doi: 10.3389/fpls.2016.01826. eCollection 2016.
8
Chloroplast Phylogenomic Inference of Green Algae Relationships.绿藻亲缘关系的叶绿体系统发育基因组推断
Sci Rep. 2016 Feb 5;6:20528. doi: 10.1038/srep20528.
9
Trends in substitution models of molecular evolution.分子进化替代模型的趋势。
Front Genet. 2015 Oct 26;6:319. doi: 10.3389/fgene.2015.00319. eCollection 2015.
10
Evidence of Statistical Inconsistency of Phylogenetic Methods in the Presence of Multiple Sequence Alignment Uncertainty.在存在多序列比对不确定性的情况下系统发育方法统计不一致性的证据。
Genome Biol Evol. 2015 Jul 1;7(8):2102-16. doi: 10.1093/gbe/evv127.
Mol Biol Evol. 2010 Dec;27(12):2804-16. doi: 10.1093/molbev/msq166. Epub 2010 Jun 29.
4
Did genome duplication drive the origin of teleosts? A comparative study of diversification in ray-finned fishes.基因组复制推动了硬骨鱼的起源吗?辐鳍鱼类多样化的比较研究。
BMC Evol Biol. 2009 Aug 8;9:194. doi: 10.1186/1471-2148-9-194.
5
Phylogeny of the Ferungulata (Mammalia: Laurasiatheria) as determined from phylogenomic data.基于系统发育组学数据确定的有胎盘类(哺乳纲:劳亚兽总目)系统发育关系。
Mol Phylogenet Evol. 2009 Sep;52(3):660-4. doi: 10.1016/j.ympev.2009.05.002. Epub 2009 May 10.
6
CpG islands: starting blocks for replication and transcription.CpG岛:复制与转录的起始区域
PLoS Genet. 2009 Apr;5(4):e1000454. doi: 10.1371/journal.pgen.1000454. Epub 2009 Apr 10.
7
Cryptic variation in the human mutation rate.人类突变率中的隐秘变异。
PLoS Biol. 2009 Feb 3;7(2):e1000027. doi: 10.1371/journal.pbio.1000027.
8
Transcription-induced mutational strand bias and its effect on substitution rates in human genes.转录诱导的突变链偏向及其对人类基因替换率的影响。
Mol Biol Evol. 2009 Jan;26(1):131-42. doi: 10.1093/molbev/msn245. Epub 2008 Oct 29.
9
jModelTest: phylogenetic model averaging.jModelTest:系统发育模型平均法。
Mol Biol Evol. 2008 Jul;25(7):1253-6. doi: 10.1093/molbev/msn083. Epub 2008 Apr 8.
10
Does choice in model selection affect maximum likelihood analysis?模型选择中的选择会影响最大似然分析吗?
Syst Biol. 2008 Feb;57(1):76-85. doi: 10.1080/10635150801898920.