分支位点检验的统计特性。

Statistical properties of the branch-site test of positive selection.

机构信息

Department of Genetics, Evolution and Environment, University College London, United Kingdom.

出版信息

Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18.

Abstract

The branch-site test is a likelihood ratio test to detect positive selection along prespecified lineages on a phylogeny that affects only a subset of codons in a protein-coding gene, with positive selection indicated by accelerated nonsynonymous substitutions (with ω = d(N)/d(S) > 1). This test may have more power than earlier methods, which average nucleotide substitution rates over sites in the protein and/or over branches on the tree. However, a few recent studies questioned the statistical basis of the test and claimed that the test generated too many false positives. In this paper, we examine the null distribution of the test and conduct a computer simulation to examine the false-positive rate and the power of the test. The results suggest that the asymptotic theory is reliable for typical data sets, and indeed in our simulations, the large-sample null distribution was reliable with as few as 20-50 codons in the alignment. We examined the impact of sequence length, the strength of positive selection, and the proportion of sites under positive selection on the power of the branch-site test. We found that the test was far more powerful in detecting episodic positive selection than branch-based tests, which average substitution rates over all codons in the gene and thus miss the signal when most codons are under strong selective constraint. Recent claims of statistical problems with the branch-site test are due to misinterpretations of simulation results. Our results, as well as previous simulation studies that have demonstrated the robustness of the test, suggest that the branch-site test may be a useful tool for detecting episodic positive selection and for generating biological hypotheses for mutation studies and functional analyses. The test is sensitive to sequence and alignment errors and caution should be exercised concerning its use when data quality is in doubt.

摘要

分支位点检验是一种似然比检验，用于检测系统发育树上特定谱系中影响蛋白质编码基因中部分密码子的正选择，正选择表现为加速非同义替换（ω=d（N）/d（S）>1）。与在蛋白质中对位点或在树的分支上平均核苷酸替换率的早期方法相比，该检验可能具有更高的功效。然而，最近的一些研究质疑了该检验的统计基础，并声称该检验产生了过多的假阳性。在本文中，我们检验了检验的零假设分布，并进行了计算机模拟，以检验假阳性率和检验的功效。结果表明，渐近理论对于典型数据集是可靠的，实际上，在我们的模拟中，大样本零假设分布在比对中具有 20-50 个密码子时就可靠。我们研究了序列长度、正选择的强度以及受正选择影响的位点比例对分支位点检验功效的影响。我们发现，与基于分支的检验相比，该检验在检测突发性正选择方面具有更强的功效，因为基于分支的检验平均了基因中所有密码子的替换率，从而忽略了大多数密码子受到强烈选择约束时的信号。最近关于分支位点检验统计问题的说法是由于对模拟结果的误解所致。我们的结果以及之前的模拟研究已经证明了该检验的稳健性，表明分支位点检验可能是检测突发性正选择和为突变研究和功能分析生成生物学假设的有用工具。该检验对序列和比对错误很敏感，当数据质量存在疑问时，应谨慎使用该检验。

相似文献

Statistical properties of the branch-site test of positive selection.分支位点检验的统计特性。

Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18.

The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection.插入、缺失和比对错误对正选择分支位点检验的影响。

Mol Biol Evol. 2010 Oct;27(10):2257-67. doi: 10.1093/molbev/msq115. Epub 2010 May 5.

Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites.多重假设检验以检测仅影响少数位点的正选择谱系。

Mol Biol Evol. 2007 May;24(5):1219-28. doi: 10.1093/molbev/msm042. Epub 2007 Mar 5.

Frequent false detection of positive selection by the likelihood method with branch-site models.使用分支位点模型的似然法经常错误检测正选择。

Mol Biol Evol. 2004 Jul;21(7):1332-9. doi: 10.1093/molbev/msh117. Epub 2004 Mar 10.

Positive Darwinian selection at the pantophysin (Pan I) locus in marine gadid fishes.海洋鳕科鱼类泛ophysin（Pan I）基因座的正达尔文选择。

Mol Biol Evol. 2004 Jan;21(1):65-75. doi: 10.1093/molbev/msg237. Epub 2003 Aug 29.

Likelihood-based clustering (LiBaC) for codon models, a method for grouping sites according to similarities in the underlying process of evolution.基于似然性的密码子模型聚类（LiBaC），一种根据潜在进化过程中的相似性对位点进行分组的方法。

Mol Biol Evol. 2008 Sep;25(9):1995-2007. doi: 10.1093/molbev/msn145. Epub 2008 Jun 26.

Robust inference of positive selection from recombining coding sequences.从重组编码序列中进行正向选择的稳健推断。

Bioinformatics. 2006 Oct 15;22(20):2493-9. doi: 10.1093/bioinformatics/btl427. Epub 2006 Aug 7.

Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level.一种用于在分子水平检测正选择的改进型分支位点似然法的评估

Mol Biol Evol. 2005 Dec;22(12):2472-9. doi: 10.1093/molbev/msi237. Epub 2005 Aug 17.

Proceedings of the SMBE Tri-National Young Investigators' Workshop 2005. Control of the false discovery rate applied to the detection of positively selected amino acid sites.2005年 SMBE 三国青年研究者研讨会会议记录。应用于检测正选择氨基酸位点的错误发现率控制

Mol Biol Evol. 2006 May;23(5):919-26. doi: 10.1093/molbev/msj095. Epub 2006 Jan 19.

Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation.同义替换率的大规模分析可能对有关突变过程的假设敏感。

Gene. 2006 Aug 15;378:58-64. doi: 10.1016/j.gene.2006.04.024. Epub 2006 May 22.

引用本文的文献

The Evolutionary History and Modern Diversity of Triterpenoid Cyclases.三萜环化酶的进化史与现代多样性

Mol Biol Evol. 2025 Sep 1;42(9). doi: 10.1093/molbev/msaf203.

The evolutionary history and modern diversity of triterpenoid cyclases.三萜环化酶的进化史与现代多样性

bioRxiv. 2025 Aug 2:2024.10.28.620730. doi: 10.1101/2024.10.28.620730.

Insights into Treponema pallidum genomics from modern and ancient genomes using a novel mapping strategy.使用一种新颖的定位策略，从现代和古代基因组中洞察梅毒螺旋体基因组学。

BMC Biol. 2025 Jan 8;23(1):7. doi: 10.1186/s12915-024-02108-4.

Comparative chloroplast genome analysis of Ardisia (Myrsinoideae, Primulaceae) in China and implications for phylogenetic relationships and adaptive evolution.中国紫金牛属（报春花科，紫金牛亚科）叶绿体基因组比较分析及其对系统发育关系和适应性进化的意义

BMC Plant Biol. 2024 Dec 19;24(1):1198. doi: 10.1186/s12870-024-05892-x.

Plastome structure and phylogenetic relationships of genus Hydrocotyle (apiales): provide insights into the plastome evolution of Hydrocotyle.植物基因组结构与Hydrocotyle 属（伞形目）的系统发育关系：为 Hydrocotyle 植物基因组进化提供了新见解。

BMC Plant Biol. 2024 Aug 15;24(1):778. doi: 10.1186/s12870-024-05483-w.

Dissecting positive selection events and immunological drives during the evolution of adeno-associated virus lineages.解析腺相关病毒谱系进化过程中的正选择事件和免疫驱动因素。

PLoS Pathog. 2024 Jun 17;20(6):e1012260. doi: 10.1371/journal.ppat.1012260. eCollection 2024 Jun.

Structural basis of neuropeptide Y signaling through Y and Y receptors.神经肽Y通过Y受体和Y₂受体信号传导的结构基础。

MedComm (2020). 2024 Jun 15;5(7):e565. doi: 10.1002/mco2.565. eCollection 2024 Jul.

Analysis of Evolutionary Conservation, Expression Level, and Genetic Association at a Genome-wide Scale Reveals Heterogeneity Across Polygenic Phenotypes.在全基因组范围内分析进化保守性、表达水平和遗传关联揭示了多基因表型的异质性。

Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae115.

Unprecedented variation pattern of plastid genomes and the potential role in adaptive evolution in Poales.植物中叶绿体基因组的空前变异模式及其在禾本科植物适应进化中的潜在作用。

BMC Biol. 2024 Apr 29;22(1):97. doi: 10.1186/s12915-024-01890-5.

Genetic mechanism of body size variation in groupers: Insights from phylotranscriptomics.石斑鱼体型变异的遗传机制：系统转录组学的启示。

Zool Res. 2024 Mar 18;45(2):314-328. doi: 10.24272/j.issn.2095-8137.2023.222.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

分支位点检验的统计特性。

Statistical properties of the branch-site test of positive selection.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献