在基因组序列中寻找达尔文——统计方法的有效性和成功。

Looking for Darwin in genomic sequences--validity and success of statistical methods.

机构信息

Center for Computational Biology and Laboratory of Disease Genomics and Individualized Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, China.

出版信息

Mol Biol Evol. 2012 Oct;29(10):2889-93. doi: 10.1093/molbev/mss104. Epub 2012 Apr 3.

DOI:10.1093/molbev/mss104

PMID:22490825

Abstract

The use of codon substitution models to compare synonymous and nonsynonymous substitution rates is a widely used approach to detecting positive Darwinian selection affecting protein evolution. However, in several recent papers, Hughes and colleagues claim that codon-based likelihood-ratio tests (LRTs) are logically flawed as they lack prior hypotheses and fail to accommodate random fluctuations in synonymous and nonsynonymous substitutions Friedman and Hughes (2007) also used site-based LRTs to analyze 605 gene families consisting of human and mouse paralogues. They found that the outcome of the tests was largely determined by irrelevant factors such as the GC content at the third codon positions and the synonymous rate d(S), but not by the nonsynonymous rate d(N) or the d(N)/d(S) ratio, factors that should be related to selection. Here, we reanalyze those data. Contra Friedman and Hughes, we found that the test results are related to sequence length and the average d(N)/d(S) ratio. We examine the criticisms of Hughes and suggest that they are based on misunderstandings of the codon models and on statistical errors. Our analyses suggest that codon-based tests are useful tools for comparative analysis of genomic data sets.

摘要

使用密码子替代模型来比较同义替换率和非同义替换率是一种广泛用于检测影响蛋白质进化的正向达尔文选择的方法。然而，在最近的几篇论文中，Hughes 及其同事声称基于密码子的似然比检验（LRT）在逻辑上存在缺陷，因为它们缺乏先验假设，并且无法适应同义替换和非同义替换的随机波动。Friedman 和 Hughes（2007 年）还使用基于位点的 LRT 分析了由人类和小鼠同源基因组成的 605 个基因家族。他们发现，检验的结果在很大程度上取决于无关因素，如第三密码子位置的 GC 含量和同义替换率 d(S)，而不是非同义替换率 d(N)或 d(N)/d(S) 比值，这些因素应该与选择有关。在这里，我们重新分析了这些数据。与 Friedman 和 Hughes 相反，我们发现检验结果与序列长度和平均 d(N)/d(S) 比值有关。我们检查了 Hughes 的批评，并认为它们基于对密码子模型的误解和统计错误。我们的分析表明，基于密码子的检验是比较基因组数据集的有用工具。

相似文献

Looking for Darwin in genomic sequences--validity and success of statistical methods.在基因组序列中寻找达尔文——统计方法的有效性和成功。

Mol Biol Evol. 2012 Oct;29(10):2889-93. doi: 10.1093/molbev/mss104. Epub 2012 Apr 3.

Statistical properties of the branch-site test of positive selection.分支位点检验的统计特性。

Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18.

On the varied pattern of evolution of 2 fungal genomes: a critique of Hughes and Friedman.论两种真菌基因组的多样进化模式：对休斯和弗里德曼观点的批判

Mol Biol Evol. 2006 Dec;23(12):2279-82. doi: 10.1093/molbev/msl122. Epub 2006 Sep 18.

Nucleotide substitution pattern in rice paralogues: implication for negative correlation between the synonymous substitution rate and codon usage bias.水稻旁系同源基因中的核苷酸替代模式：对同义替代率与密码子使用偏好性之间负相关的启示。

Gene. 2006 Jul 19;376(2):199-206. doi: 10.1016/j.gene.2006.03.003. Epub 2006 Mar 18.

Likelihood-ratio tests for positive selection of human and mouse duplicate genes reveal nonconservative and anomalous properties of widely used methods.用于人类和小鼠复制基因正选择的似然比检验揭示了广泛使用方法的非保守性和异常特性。

Mol Phylogenet Evol. 2007 Feb;42(2):388-93. doi: 10.1016/j.ympev.2006.07.015. Epub 2006 Aug 3.

Evolutionary genomics: detecting selection needs comparative data.进化基因组学：检测选择需要比较数据。

Nature. 2005 Jan 20;433(7023):E6; discussion E7-8. doi: 10.1038/nature03222.

Likelihood-based clustering (LiBaC) for codon models, a method for grouping sites according to similarities in the underlying process of evolution.基于似然性的密码子模型聚类（LiBaC），一种根据潜在进化过程中的相似性对位点进行分组的方法。

Mol Biol Evol. 2008 Sep;25(9):1995-2007. doi: 10.1093/molbev/msn145. Epub 2008 Jun 26.

Synonymous substitutions substantially improve evolutionary inference from highly diverged proteins.同义替换显著改善了从高度分化的蛋白质进行的进化推断。

Syst Biol. 2008 Jun;57(3):367-77. doi: 10.1080/10635150802158670.

The genomic rate of adaptive amino acid substitution in Drosophila.果蝇中适应性氨基酸替换的基因组速率。

Mol Biol Evol. 2004 Jul;21(7):1350-60. doi: 10.1093/molbev/msh134. Epub 2004 Mar 24.

The comparative method rules! Codon volatility cannot detect positive Darwinian selection using a single genome sequence.比较方法很棒！密码子变异性无法通过单一基因组序列检测正向达尔文选择。

Mol Biol Evol. 2005 Mar;22(3):496-500. doi: 10.1093/molbev/msi033. Epub 2004 Nov 3.

引用本文的文献

AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis.亚历山德罗斯 PS：用于自动检测直系同源基因簇和随后的正选择分析的用户友好型流水线。

Genome Biol Evol. 2023 Oct 6;15(10). doi: 10.1093/gbe/evad187.

Complete mitochondrial genomes reveal robust phylogenetic signals and evidence of positive selection in horseshoe bats.完整的线粒体基因组揭示了马蹄蝠中强大的系统发育信号和正选择证据。

BMC Ecol Evol. 2021 Nov 3;21(1):199. doi: 10.1186/s12862-021-01926-2.

Integrated structural and evolutionary analysis reveals common mechanisms underlying adaptive evolution in mammals.综合结构和进化分析揭示了哺乳动物适应性进化的共同机制。

Proc Natl Acad Sci U S A. 2020 Mar 17;117(11):5977-5986. doi: 10.1073/pnas.1916786117. Epub 2020 Mar 2.

Molecular evolution of mammalian genes with epistatic interactions in fertilization.哺乳动物受精过程中具有上位性相互作用的基因的分子进化。

BMC Evol Biol. 2019 Jul 25;19(1):154. doi: 10.1186/s12862-019-1480-6.

Multinucleotide mutations cause false inferences of lineage-specific positive selection.多核苷酸突变导致谱系特异性正选择的错误推断。

Nat Ecol Evol. 2018 Aug;2(8):1280-1288. doi: 10.1038/s41559-018-0584-5. Epub 2018 Jul 2.

Evidence of a Conserved Molecular Response to Selection for Increased Brain Size in Primates.灵长类动物大脑增大选择的保守分子反应证据。

Genome Biol Evol. 2017 Mar 1;9(3):700-713. doi: 10.1093/gbe/evx028.

LMAP: Lightweight Multigene Analyses in PAML.LMAP：PAML中的轻量级多基因分析

BMC Bioinformatics. 2016 Sep 6;17(1):354. doi: 10.1186/s12859-016-1204-5.

Essentiality Is a Strong Determinant of Protein Rates of Evolution during Mutation Accumulation Experiments in Escherichia coli.在大肠杆菌的突变积累实验中，必需性是蛋白质进化速率的一个重要决定因素。

Genome Biol Evol. 2016 Sep 26;8(9):2914-2927. doi: 10.1093/gbe/evw205.

Positive selection on panpulmonate mitogenomes provide new clues on adaptations to terrestrial life.泛肺类线粒体基因组的正向选择为适应陆地生活提供了新线索。

BMC Evol Biol. 2016 Aug 22;16(1):164. doi: 10.1186/s12862-016-0735-8.

Evolutionary dynamics of Rh2 opsins in birds demonstrate an episode of accelerated evolution in the New World warblers (Setophaga).鸟类中Rh2视蛋白的进化动力学表明，新大陆林莺（虫森莺属）经历了一段加速进化期。

Mol Ecol. 2015 May;24(10):2449-62. doi: 10.1111/mec.13180. Epub 2015 Apr 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在基因组序列中寻找达尔文——统计方法的有效性和成功。

Looking for Darwin in genomic sequences--validity and success of statistical methods.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献