Suppr超能文献

用于检测蛋白质编码序列中适应性进化以及识别正选择位点的统计方法的准确性和功效。

Accuracy and power of statistical methods for detecting adaptive evolution in protein coding sequences and for identifying positively selected sites.

作者信息

Wong Wendy S W, Yang Ziheng, Goldman Nick, Nielsen Rasmus

机构信息

Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA.

出版信息

Genetics. 2004 Oct;168(2):1041-51. doi: 10.1534/genetics.104.031153.

Abstract

The parsimony method of Suzuki and Gojobori (1999) and the maximum likelihood method developed from the work of Nielsen and Yang (1998) are two widely used methods for detecting positive selection in homologous protein coding sequences. Both methods consider an excess of nonsynonymous (replacement) substitutions as evidence for positive selection. Previously published simulation studies comparing the performance of the two methods show contradictory results. Here we conduct a more thorough simulation study to cover and extend the parameter space used in previous studies. We also reanalyzed an HLA data set that was previously proposed to cause problems when analyzed using the maximum likelihood method. Our new simulations and a reanalysis of the HLA data demonstrate that the maximum likelihood method has good power and accuracy in detecting positive selection over a wide range of parameter values. Previous studies reporting poor performance of the method appear to be due to numerical problems in the optimization algorithms and did not reflect the true performance of the method. The parsimony method has a very low rate of false positives but very little power for detecting positive selection or identifying positively selected sites.

摘要

铃木和五条博(1999年)提出的简约法以及尼尔森和杨(1998年)研究成果发展而来的最大似然法,是检测同源蛋白质编码序列中正向选择的两种广泛使用的方法。两种方法都将过量的非同义(替换)替换视为正向选择的证据。先前发表的比较这两种方法性能的模拟研究显示出相互矛盾的结果。在此,我们进行了更全面的模拟研究,以涵盖并扩展先前研究中使用的参数空间。我们还重新分析了一个先前提出的使用最大似然法分析时会产生问题的HLA数据集。我们新的模拟以及对HLA数据的重新分析表明,最大似然法在广泛的参数值范围内检测正向选择时具有良好的功效和准确性。先前报告该方法性能不佳的研究似乎是由于优化算法中的数值问题,并未反映该方法的真实性能。简约法的假阳性率非常低,但检测正向选择或识别正向选择位点的能力非常有限。

相似文献

2
A method for detecting positive selection at single amino acid sites.一种检测单个氨基酸位点正选择的方法。
Mol Biol Evol. 1999 Oct;16(10):1315-28. doi: 10.1093/oxfordjournals.molbev.a026042.
4
Statistical properties of the branch-site test of positive selection.分支位点检验的统计特性。
Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18.
10
Detecting individual sites subject to episodic diversifying selection.检测易发性分歧选择的个体位点。
PLoS Genet. 2012;8(7):e1002764. doi: 10.1371/journal.pgen.1002764. Epub 2012 Jul 12.

引用本文的文献

本文引用的文献

6
Pervasive adaptive evolution in mammalian fertilization proteins.哺乳动物受精蛋白中的普遍适应性进化。
Mol Biol Evol. 2003 Jan;20(1):18-20. doi: 10.1093/oxfordjournals.molbev.a004233.
8
Tracking adaptive evolutionary events in genomic sequences.追踪基因组序列中的适应性进化事件。
Genome Biol. 2002;3(6):REVIEWS1018. doi: 10.1186/gb-2002-3-6-reviews1018. Epub 2002 May 29.
10
The rapid evolution of reproductive proteins.生殖蛋白的快速进化。
Nat Rev Genet. 2002 Feb;3(2):137-44. doi: 10.1038/nrg733.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验