• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于简约法估计的位点间替换率分布的核苷酸序列间的进化距离。

Evolutionary distances between nucleotide sequences based on the distribution of substitution rates among sites as estimated by parsimony.

作者信息

Tourasse N J, Gouy M

机构信息

Laboratorie de Biométrie, Génétique et Biologie des Populations, Université Claude Bernard, France.

出版信息

Mol Biol Evol. 1997 Mar;14(3):287-98. doi: 10.1093/oxfordjournals.molbev.a025764.

DOI:10.1093/oxfordjournals.molbev.a025764
PMID:9066796
Abstract

The rate of evolution of macromolecules such as ribosomal RNAs and proteins varies along the molecule because structural and functional constraints differ between sites. Many studies have shown that ignoring this variation in computing evolutionary distances leads to severe underestimation of sequence divergences, and thus can lead to misleading evolutionary tree inferences. We propose here a new parsimony-based method for computing evolutionary distances between pairs of sequences that takes into account this variation and estimates it from the data. This method applies to the number of substitutions per site in ribosomal RNA genes as well as to the number of nonsynonymous substitutions per codon for protein-coding genes and is especially suitable when large data sets (> or = 100 sequences) are analyzed. First, starting from a phylogeny constructed with usual distances, the maximum-parsimony method is used to infer the distribution of the number of substitutions that have occurred at each site (or codon) along this tree. This distribution is then fitted to an "invariant + truncated negative binomial" distribution that allows for invariant sites. Maximum-likelihood fitting of this distribution to different data sets showed that it agreed very well with real data. Noticeably, allowing for invariant sites seemed to be very important. Finally, two distance estimates were developed by introducing the distribution of site variability into the substitution models of Jukes and Cantor and of Kimura. The use of different numbers of aligned sequences (up to 1,000 rRNA sequences) showed that the parameters of the model are very sensitive to the number of sequences used to estimate them. However, if at least 100 sequences are considered, the two new distance estimates are quite stable with respect to the number of sequences used to fit the distribution. This stability is true for low as well as for high evolutionary distances. These new distances appeared to be much better estimates of the number of substitutions per site than the classical distances of Jukes and Cantor and of Kimura, which both greatly underestimate this number, so that they can serve as indexes to detect saturation. We conclude that the new distances are particularly suitable for phylogenetic analysis when very distantly related species and relatively large data sets are considered. Trees reconstructed using these distances are generally different from those constructed by means of the classical estimates. Using this new method, we showed that the mean evolutionary distance between Prokaryotes and Eukaryotes is substantially higher for the small-subunit than for the large-subunit rRNAs. This suggests than the former might have experienced a drastic change during the early evolution of Eukaryotes.

摘要

核糖体RNA和蛋白质等大分子的进化速率在分子上各部位有所不同,因为各部位的结构和功能限制存在差异。许多研究表明,在计算进化距离时忽略这种差异会导致对序列分歧严重低估,进而可能得出误导性的进化树推断。我们在此提出一种基于简约法的新方法,用于计算序列对之间的进化距离,该方法考虑了这种差异并从数据中对其进行估计。此方法适用于核糖体RNA基因中每个位点的替换数,以及蛋白质编码基因中每个密码子的非同义替换数,尤其适用于分析大数据集(≥100个序列)时。首先,从用常规距离构建的系统发育树出发,使用最大简约法推断沿此树每个位点(或密码子)发生的替换数的分布。然后将该分布拟合为允许存在不变位点的“不变 + 截断负二项式”分布。对不同数据集进行该分布的最大似然拟合表明,它与实际数据吻合得很好。值得注意的是,允许存在不变位点似乎非常重要。最后,通过将位点变异性分布引入Jukes和Cantor以及Kimura的替换模型,得出了两种距离估计值。使用不同数量的比对序列(多达1000个rRNA序列)表明,模型参数对用于估计它们的序列数量非常敏感。然而,如果考虑至少100个序列,这两种新的距离估计值对于用于拟合分布的序列数量相当稳定。无论是低进化距离还是高进化距离,这种稳定性都成立。这些新距离似乎比Jukes和Cantor以及Kimura的经典距离更能准确估计每个位点的替换数,后两者都极大地低估了这个数字,因此它们可作为检测饱和度的指标。我们得出结论,当考虑亲缘关系非常远的物种和相对较大的数据集时,新距离特别适合系统发育分析。使用这些距离重建的树通常与通过经典估计构建的树不同。使用这种新方法,我们表明原核生物和真核生物之间小亚基的平均进化距离比大亚基rRNA的平均进化距离高得多。这表明前者在真核生物早期进化过程中可能经历了剧烈变化。

相似文献

1
Evolutionary distances between nucleotide sequences based on the distribution of substitution rates among sites as estimated by parsimony.基于简约法估计的位点间替换率分布的核苷酸序列间的进化距离。
Mol Biol Evol. 1997 Mar;14(3):287-98. doi: 10.1093/oxfordjournals.molbev.a025764.
2
Calculating the probability of multitaxon evolutionary trees: bootstrappers Gambit.计算多分类单元进化树的概率:自展策略
Proc Natl Acad Sci U S A. 1995 Oct 10;92(21):9662-6. doi: 10.1073/pnas.92.21.9662.
3
Reconstructing evolution from eukaryotic small-ribosomal-subunit RNA sequences: calibration of the molecular clock.从真核生物小核糖体亚基RNA序列重建进化:分子钟的校准
J Mol Evol. 1993 Aug;37(2):221-32. doi: 10.1007/BF02407359.
4
Empirical models for substitution in ribosomal RNA.核糖体RNA中替代的经验模型。
Mol Biol Evol. 2004 Mar;21(3):419-27. doi: 10.1093/molbev/msh029. Epub 2003 Dec 5.
5
Relative efficiencies of the maximum-likelihood, neighbor-joining, and maximum-parsimony methods when substitution rate varies with site.当替换率随位点变化时,最大似然法、邻接法和最大简约法的相对效率。
Mol Biol Evol. 1994 Mar;11(2):261-77. doi: 10.1093/oxfordjournals.molbev.a040108.
6
An updated and comprehensive rRNA phylogeny of (crown) eukaryotes based on rate-calibrated evolutionary distances.基于速率校准进化距离的(冠)真核生物更新且全面的核糖体RNA系统发育学
J Mol Evol. 2000 Dec;51(6):565-76. doi: 10.1007/s002390010120.
7
When is it safe to use an oversimplified substitution model in tree-making?在构建树状图时,何时使用过于简化的替代模型是安全的?
Mol Biol Evol. 1996 Nov;13(9):1255-65. doi: 10.1093/oxfordjournals.molbev.a025691.
8
Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree.最大简约法和距离矩阵法在获得正确系统发育树方面的相对效率。
Mol Biol Evol. 1988 May;5(3):298-311. doi: 10.1093/oxfordjournals.molbev.a040497.
9
Phylogeny of some ascaridoid nematodes, inferred from comparison of 18S and 28S rRNA sequences.基于18S和28S rRNA序列比较推断的某些蛔形线虫的系统发育
Mol Biol Evol. 1992 Sep;9(5):932-44. doi: 10.1093/oxfordjournals.molbev.a040768.
10
Tests of applicability of several substitution models for DNA sequence data.几种DNA序列数据替代模型的适用性测试。
Mol Biol Evol. 1995 Jan;12(1):131-51. doi: 10.1093/oxfordjournals.molbev.a040182.

引用本文的文献

1
Uncovering the Horseshoe Effect in Microbial Analyses.揭示微生物分析中的马蹄铁效应。
mSystems. 2017 Feb 21;2(1). doi: 10.1128/mSystems.00166-16. eCollection 2017 Jan-Feb.
2
The impact of modelling rate heterogeneity among sites on phylogenetic estimates of intraspecific evolutionary rates and timescales.对种内进化率和时间尺度的系统发育估计中,模型站点间变异率异质性的影响。
PLoS One. 2014 May 5;9(5):e95722. doi: 10.1371/journal.pone.0095722. eCollection 2014.
3
Long branch effects distort maximum likelihood phylogenies in simulations despite selection of the correct model.
尽管选择了正确的模型,长枝效应仍会在模拟中扭曲最大似然系统发育。
PLoS One. 2012;7(5):e36593. doi: 10.1371/journal.pone.0036593. Epub 2012 May 9.
4
Comparisons of mutation rate variation at genome-wide microsatellites: evolutionary insights from two cultivated rice and their wild relatives.全基因组微卫星突变率变异的比较:来自两种栽培稻及其野生近缘种的进化见解
BMC Evol Biol. 2008 Jan 16;8:11. doi: 10.1186/1471-2148-8-11.
5
Mutation rate variation at human dinucleotide microsatellites.人类双核苷酸微卫星的突变率变异
Genetics. 2005 May;170(1):305-12. doi: 10.1534/genetics.104.036855. Epub 2005 Feb 16.
6
Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.核苷酸替换的平稳和非平稳模型下进化距离的估计。
Proc Natl Acad Sci U S A. 1998 May 26;95(11):5899-905. doi: 10.1073/pnas.95.11.5899.