用于成对序列比较的非同义/同义率比的贝叶斯估计。

Bayesian estimation of nonsynonymous/synonymous rate ratios for pairwise sequence comparisons.

作者信息

Angelis Konstantinos, Dos Reis Mario, Yang Ziheng

机构信息

Department of Genetics, Evolution and Environment, University College London, London, United Kingdom.

Department of Genetics, Evolution and Environment, University College London, London, United Kingdom

出版信息

Mol Biol Evol. 2014 Jul;31(7):1902-13. doi: 10.1093/molbev/msu142. Epub 2014 Apr 18.

DOI:10.1093/molbev/msu142

PMID:24748652

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4069626/

Abstract

The nonsynonymous/synonymous rate ratio (ω = d(N)/d(S)) is an important measure of the mode and strength of natural selection acting on nonsynonymous mutations in protein-coding genes. The simplest such analysis is the estimation of the d(N)/d(S) ratio using two sequences. Both heuristic counting methods and the maximum-likelihood (ML) method based on a codon substitution model are widely used for such analysis. However, these methods do not have nice statistical properties, as the estimates can be zero or infinity in some data sets, so that their means and variances are infinite. In large genome-scale comparisons, such extreme estimates (either 0 or ∞) of ω and sequence distance (t) are common. Here, we implement a Bayesian method to estimate ω and t in pairwise sequence comparisons. Using a combination of computer simulation and real data analysis, we show that the Bayesian estimates have better statistical properties than the ML estimates, because the prior on ω and t shrinks the posterior of those parameters away from extreme values. We also calculate the posterior probability for ω > 1 as a Bayesian alternative to the likelihood ratio test. The new method is computationally efficient and may be useful for genome-scale comparisons of protein-coding gene sequences.

摘要

非同义/同义比率（ω = d(N)/d(S)）是衡量作用于蛋白质编码基因中非同义突变的自然选择模式和强度的重要指标。最简单的此类分析是使用两个序列估计d(N)/d(S)比率。启发式计数方法和基于密码子替换模型的最大似然（ML）方法都广泛用于此类分析。然而，这些方法没有良好的统计特性，因为在某些数据集中估计值可能为零或无穷大，因此它们的均值和方差是无穷的。在大规模基因组比较中，ω和序列距离（t）的此类极端估计值（0或∞）很常见。在这里，我们实现了一种贝叶斯方法来估计成对序列比较中的ω和t。通过计算机模拟和实际数据分析相结合，我们表明贝叶斯估计比ML估计具有更好的统计特性，因为ω和t的先验将这些参数的后验从极端值收缩回来。我们还计算了ω > 1的后验概率，作为似然比检验的贝叶斯替代方法。新方法计算效率高，可能对蛋白质编码基因序列的基因组规模比较有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5905/4069626/f856347890ed/msu142f1p.jpg

相似文献

Bayesian estimation of nonsynonymous/synonymous rate ratios for pairwise sequence comparisons.用于成对序列比较的非同义/同义率比的贝叶斯估计。

Mol Biol Evol. 2014 Jul;31(7):1902-13. doi: 10.1093/molbev/msu142. Epub 2014 Apr 18.

Why do more divergent sequences produce smaller nonsynonymous/synonymous rate ratios in pairwise sequence comparisons?为什么在成对序列比较中，差异更大的序列产生更小的非同义/同义替换率比值？

Genetics. 2013 Sep;195(1):195-204. doi: 10.1534/genetics.113.152025. Epub 2013 Jun 21.

Bayes empirical bayes inference of amino acid sites under positive selection.正选择下氨基酸位点的贝叶斯经验贝叶斯推断

Mol Biol Evol. 2005 Apr;22(4):1107-18. doi: 10.1093/molbev/msi097. Epub 2005 Feb 2.

Synonymous and nonsynonymous rate variation in nuclear genes of mammals.哺乳动物核基因中的同义突变率和非同义突变率变异

J Mol Evol. 1998 Apr;46(4):409-18. doi: 10.1007/pl00006320.

Inference of mutation parameters and selective constraint in mammalian coding sequences by approximate Bayesian computation.通过近似贝叶斯计算推断哺乳动物编码序列中的突变参数和选择约束。

Genetics. 2011 Apr;187(4):1153-61. doi: 10.1534/genetics.110.124073. Epub 2011 Feb 1.

A Dirichlet process model for detecting positive selection in protein-coding DNA sequences.一种用于检测蛋白质编码DNA序列中正选择的狄利克雷过程模型。

Proc Natl Acad Sci U S A. 2006 Apr 18;103(16):6263-8. doi: 10.1073/pnas.0508279103. Epub 2006 Apr 10.

Detecting positively selected sites from amino Acid sequences: an implicit codon model.从氨基酸序列中检测正选择位点：一种隐式密码子模型。

Annu Int Conf IEEE Eng Med Biol Soc. 2007;2007:5302-6. doi: 10.1109/IEMBS.2007.4353538.

Estimating absolute rates of synonymous and nonsynonymous nucleotide substitution in order to characterize natural selection and date species divergences.估计同义核苷酸替换和非同义核苷酸替换的绝对速率，以表征自然选择并确定物种分化的时间。

Mol Biol Evol. 2004 Jul;21(7):1201-13. doi: 10.1093/molbev/msh088. Epub 2004 Mar 10.

Codon-substitution models for heterogeneous selection pressure at amino acid sites.氨基酸位点上异质选择压力的密码子替换模型。

Genetics. 2000 May;155(1):431-49. doi: 10.1093/genetics/155.1.431.

Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计

BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

引用本文的文献

Non-synonymous to synonymous substitutions suggest that orthologs tend to keep their functions, while paralogs are a source of functional novelty.非同义替换与同义替换表明，直系同源物倾向于保持其功能，而旁系同源物是功能新颖性的来源。

PeerJ. 2022 Aug 31;10:e13843. doi: 10.7717/peerj.13843. eCollection 2022.

Molecular fossils illuminate the evolution of retroviruses following a macroevolutionary transition from land to water.分子化石阐明了病毒从陆地到水中的宏观进化转变后的进化。

PLoS Pathog. 2021 Jul 12;17(7):e1009730. doi: 10.1371/journal.ppat.1009730. eCollection 2021 Jul.

Relation between mitochondrial DNA hyperdiversity, mutation rate and mitochondrial genome evolution in Melarhaphe neritoides (Gastropoda: Littorinidae) and other Caenogastropoda.线粒体 DNA 超多样性、突变率与泥螺（腹足纲：滨螺科）及其他后鳃类软体动物线粒体基因组进化的关系。

Sci Rep. 2018 Dec 19;8(1):17964. doi: 10.1038/s41598-018-36428-7.

Selective constraints in cold-region wild boars may defuse the effects of small effective population size on molecular evolution of mitogenomes.寒冷地区野猪的选择限制可能会消除有效种群规模较小对线粒体基因组分子进化的影响。

Ecol Evol. 2018 Jul 21;8(16):8102-8114. doi: 10.1002/ece3.4221. eCollection 2018 Aug.

Inter and Intraspecific Genomic Divergence in Drosophila montana Shows Evidence for Cold Adaptation.蒙塔那果蝇的种间和种内基因组差异显示出对寒冷适应的证据。

Genome Biol Evol. 2018 Aug 1;10(8):2086-2101. doi: 10.1093/gbe/evy147.

Genomic insights into natural selection in the common loon (Gavia immer): evidence for aquatic adaptation.基因组视角下的普通潜鸟（Gavia immer）自然选择：水生适应的证据。

BMC Evol Biol. 2018 Apr 27;18(1):64. doi: 10.1186/s12862-018-1181-6.

Causes of evolutionary rate variation among protein sites.蛋白质位点间进化速率变化的原因。

Nat Rev Genet. 2016 Feb;17(2):109-21. doi: 10.1038/nrg.2015.18. Epub 2016 Jan 19.

本文引用的文献

Natural selection promotes antigenic evolvability.自然选择促进抗原进化能力。

PLoS Pathog. 2013;9(11):e1003766. doi: 10.1371/journal.ppat.1003766. Epub 2013 Nov 14.

Asexual genome evolution in the apomictic Ranunculus auricomus complex: examining the effects of hybridization and mutation accumulation.无融合生殖毛茛复合体中的无性基因组进化：杂交和突变积累的影响研究。

Mol Ecol. 2013 Dec;22(23):5908-21. doi: 10.1111/mec.12533. Epub 2013 Nov 6.

Purifying selection after episodes of recurrent adaptive diversification in fungal pathogens.真菌病原体反复适应性多样化后的净化选择。

Infect Genet Evol. 2013 Jul;17:123-31. doi: 10.1016/j.meegid.2013.03.012. Epub 2013 Apr 10.

The evolutionary landscape of the Mycobacterium tuberculosis genome.结核分枝杆菌基因组的进化景观。

Gene. 2013 Apr 10;518(1):187-93. doi: 10.1016/j.gene.2012.11.033. Epub 2012 Dec 7.

Phylogenomic datasets provide both precision and accuracy in estimating the timescale of placental mammal phylogeny.系统基因组数据集在估计胎盘哺乳动物系统发育的时间尺度方面提供了精确性和准确性。

Proc Biol Sci. 2012 Sep 7;279(1742):3491-500. doi: 10.1098/rspb.2012.0683. Epub 2012 May 23.

Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms.缓慢但不低：基因组比较显示，与被子植物相比，松柏类植物的进化速度更慢，dN/dS 更高。

BMC Evol Biol. 2012 Jan 20;12:8. doi: 10.1186/1471-2148-12-8.

Transcriptome characterization and polymorphism detection between subspecies of big sagebrush (Artemisia tridentata).转录组特征分析及大针茅亚种间的多态性检测

BMC Genomics. 2011 Jul 18;12:370. doi: 10.1186/1471-2164-12-370.

Isolation and characterization of the CYP2D6 gene in Felidae with comparison to other mammals.猫科 CYP2D6 基因的分离与鉴定及其与其他哺乳动物的比较。

J Mol Evol. 2011 Feb;72(2):222-31. doi: 10.1007/s00239-010-9424-1. Epub 2010 Dec 28.

Combined EST and proteomic analysis identifies rapidly evolving seminal fluid proteins in Heliconius butterflies.联合 EST 和蛋白质组学分析鉴定食蚜蝇蝴蝶中快速进化的精液蛋白。

Mol Biol Evol. 2010 Sep;27(9):2000-13. doi: 10.1093/molbev/msq092. Epub 2010 Apr 7.

High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome.在基因组未被充分表征的巨桉中进行高通量基因和单核苷酸多态性发现。

BMC Genomics. 2008 Jun 30;9:312. doi: 10.1186/1471-2164-9-312.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于成对序列比较的非同义/同义率比的贝叶斯估计。

Bayesian estimation of nonsynonymous/synonymous rate ratios for pairwise sequence comparisons.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献